kaldi monophone training outline

This outline is based on the steps/train_mono.sh script.

Use a subset of the training set to init the monophone model

synopsis

gmm-init-mono [options] Topo-in Dim Model-out Tree-out 

commands

gmm-init-mono "--train-feats=$feats subset-feat --n=10 ark:- ark:-|"\
    $lang/topo $feat_dim $dir/0.mdl $dir/tree

Compiling training graphs

synopsis

compile-train-graphs [options] Tree-in Model-in L-fst-in Trans-rspec Graph-wspec

commands

trans_rspec="ark:sym2int.pl --map-oov $oov_sym -f 2- $lang/words.txt < $sdata/JOB/text|"
out_graphs="ark:|gzip -c>$dir/fsts.JOB.gz"
compile-train-graphs $dir/tree $dir/0.mdl $lang/L.fst $trans_rspec $out_graph

Align data equally

synopsis

align-equal-compiled Graph-rspec Feats-rspec Align-wspec

gmm-acc-stats-ali [options] Model-in Feat-rspec Align-rspec Stats-out

commands

graph="ark:gunzip -c $dir/fsts.JOB.gz|"
align-equal-compiled $graph "$feats" ark,t:- | \
gmm-acc-stats-ali --binary=true $dir/0.mdl "$feats" ark:-

Estimate GMM with equally aligned frames

synopsis

gmm-est [options] Model-in Stats-in Model-out

commands

gmm-est [options] $dir/0.mdl "gmm-sum-accs - $dir/0.*.acc|" $dir/1.mdl

Alternatively align data and update model

  • align data according to $realign_iters
  • increase the amount of GMMs until $max_iter_inc, with (totgauss - numgauss) / max_iter_inc each iteration
  • update GMMs in each iteration
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值