Kaldi 提取MFCC40维的参数设置

1. 数据准备:wave文件,,获取wav.scp,spk2utt,utt2spk三个文件

find /*/16kwav -name '*.wav' | awk -F '/' '{print $NF " " $0}' > ./data/wav.scp

find /*/16kwav -name '*.wav' | awk -F '/' '{print $NF " " $NF}' > ./data/spk2utt

find /*/16kwav -name '*.wav' | awk -F '/' '{print $NF " " $NF}' > ./data/utt2spk

2. 特征提取

首先需要更改conf/mfcc.conf文件参数,更改如下:

# config for high-resolution MFCC features, intended for neural network training.

# Note: we keep all cepstra, so it has the same info as filterbank features,

# but MFCC is more easily compressible (because less correlated) which is why

# we prefer this method.

--use-energy=false       # use average of log energy, not energy.

--sample-frequency=16000 # AISHELL-2 is sampled at 16kHz

--num-mel-bins=40        # similar to Google's setup.

--num-ceps=40            # there is no dimensionality reduction.

--low-freq=20            # low cutoff frequency for mel bins

--high-freq=-400         # high cutoff frequency, relative to Nyquist of 8000 (=7600)

接下来运行如下命令:

utils/fix_data_dir.sh       /*/data
./steps/make_mfcc.sh  /*/data  ./ts_log /*/data/mfcc

 

  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值