什么是Kaldi?
Kaldi is a speech recognition toolkit, freely available under the Apache License.
注意,Kaldi仅仅是一个工具包,不是一个语音识别框架,想做语音识别,框架还要自己写。
这里有一系列ASR开源软件的比较:
https://en.wikipedia.org/wiki/List_of_speech_recognition_software
可以看到Kaldi是唯一一个用DNN做声学模型的。
安装Kaldi很简单,傻瓜化,官网上提供了详尽的帮助。
http://kaldi-asr.org/doc/install.html
重点看一下需要的第三方库:
Software packages installed by Kaldi
The following tools and libraries come with installation scripts in the tools/ directory so you won’t have to install them yourself (note: this is a non-exhaustive list).
- OpenFst: we compile against this and use it heavily. //有限状态机,实现上就是一个有向图,google开发的。
- IRSTLM: this a language modeling toolkit. Som