This page contains collaboratively developed documentation for the CMU Sphinx speech recognition engines.
此页面包含的是关于CMU Sphinx的语音识别引擎的合作开发文档。
Start User Documentation 开始用户文档
This section contains links to documents which describe how to use Sphinx to recognize speech. Currently, we have very little in the way of end-user tools, so it may be a bit sparse for the forseeable future.
CMUSphinx Tutorial For Developers: Getting started with CMUSphinx for developers
本部分包含一些链接文件,这些文件描述了如何使用Sphinx来识别语音。目前,我们拥有的关于终端用户的工具很少,因此在可预见的将来,可能还是很少。
CMUSphinx 开发者教程:CMUSphinx 开发者入门指南
语音基本概念
CMUSphinx工具包概览
开始之前
使用pocketsphinx构建应用
使用sphinx4构建应用
构建语言模型
自适应现有声学模型
构建声学模型
构建字典
在Android上使用pocketsphinx
You are in trouble - read the Frequenty Asked Questions (FAQ)
See also some more docs:
-
Decoder Versions: Description of the software packages
-
Download Details: How to obtain CMUSphinx packages
-
How to get help and discuss things: How to get help and discuss things
-
http://cmusphinx.sourceforge.net/doc/speech.ppt Cool presentation done by Heather Dewey-Hagborg
If you want to find out where CMUSphinx works, see
-
Projects that use Sphinx: These projects, both commercial and free, use Sphinx in one form or another
遇到麻烦 - 阅读 常见问题
另见更多的文档:
解码器版本:软件包说明
下载详细信息:如何获得CMUSphinx包
如何获取帮助和讨论
http://cmusphinx.sourceforge.net/doc/speech.ppt Heather Dewey-Hagborg做的很酷的介绍
如果你想找出CMUSphinx在哪些地方应用,请参阅
使用Sphinx的项目:这些项目以某种形式使用Sphinx,有商业和免费的两种
These documents either describe some particular aspect of the Sphinx codebase in detail, or they serve as a developer's guide to accomplishing some particular task.
-
Building on IPhone: Building Pocketsphinx on various platforms
-
Integrating CMUSphinx with Telephony Servers - Asterisk and Freeswitch: How to use pocketsphinx in Asterisk.
-
The Incomplete Guide to Sphinx-3 Performance Tuning: How to tune the decoder to be fast (or rather, not horribly slow)
-
Phoneme Recognition (caveat emptor): How to use Sphinx3 for phoneme recognition.
-
Segmentation and Diarization using LIUM tools: Using LIUM tools for speech segmentation and speaker diarization
-
Training an acoustic model with LDA and MLLT feature transforms: How to train acoustic models with LDA and MLLT feature transforms
-
Using PocketSphinx with GStreamer and Python (or Vala): How to use PocketSphinx with GStreamer and Python
-
InstallingPythonStuff: How to install Python and necessary modules for SphinxTrain development
-
MMIE Training in SphinxTrain: How to perform MMIE training.
-
http://www.speech.cs.cmu.edu/sphinx/tutorial.html Robust Group Tutorial (classic tutorial from CMU Speech Group website)
高级用户文档
这些文档要么在Sphinx代码的某些特别的方面做了详细的说明,要么作为开发人员的指南,完成一些特定的任务。
在Iphone上构建:在各种平台上构建Pocketsphinx
电话服务集成CMUSphinx
Sphinx-3性能调节的不完整指南:如何调整解码器变快(更确切的说,不是可怕的慢) 嵌入式设备优化pocketsphinx。
音素识别(使用者当心):如何使用Sphinx3识别音素
使用LIUM工具进行分段和分类:使用LIUM工具进行语音分段和说话人分类
训练声学模型时使用LDA和MLLT做特征变换:如何使用LDA和MLLT特征变换来训练声学模型
使用GStreamer和Python的PocketSphinx
安装Python文件:如何安装Python和SphinxTrain开发的必要模块
SphinxTrain中MMIE训练
Decoder Space
-
Sphinx4 Space : Information about sphinx4, design, code, performance, history.
Reference
These documents describe the excruciating detail of APIs, or provide other useful background information for CMUSphinx developers.
解码空间
Sphinx4 空间:sphinx4、设计、代码、性能、历史信息
参考
这些文档有的说明难以忍受的APIs细节,有的为CMUSphinx开发人员提供有用的背景信息
PocketSphinx的文档生成器
SphinxBase的文档成器
SphinxTrain Python模块的ePyDoc文档
Sphinx4 的JavaDocs
This section contains various internal information for CMUSphinx developers. But we hope it will be still usable for you.
-
Sphinx-4 Regression Tests: How to run regression tests
-
Layout of SphinxTrain code: An overview of the SphinxTrain source code for researchers and developers
-
CMUCLMTK development: Development guide for the CMU-Cambridge Language Modeling Toolkit.
-
Language Features for SphinxBase, SphinxThree, and SphinxTrain
-
Upcoming CMU Sphinx Software Releases: Plans for upcoming releases of Sphinx
-
Release Check List: How to make a release
-
Web Site Layout: How to organize information
File formats
Data sources:
Materials for GSOC
-
Information for Students: Students information
-
Tasks for Summer Of Code Projects: Ideas for students
GSoc Previous years
-
Google Summer of Code 2012 Projects: Google Summer of Code 2012 Projects
开发者文档
本节包含为CMUSphinx开发者提供的各种内部信息,但是,我们希望这将对你仍然有用。
Sphinx-4的回归测试:如何执行回归测试
SphinxTrain的代码布局:
CMUCLMTK开发:
SphinxBase, SphinxThree, and SphinxTrain的语言特征
即将发布的CMU Sphinx软件
发布清单:
网站布局:
文件格式:
声学模型格式
MFC文件
数据源
数据源
GSOC资料
学生信息
夏季代码工程的任务
GSOC往年
google 2012年的夏季代码工程
This section tries to collect research ideas for specific problems in speech recognition
语音识别理论:
本节试图收集语音识别的具体问题的研究思路
网格
WFST
搜索算法
语言模型
特征
噪音鲁棒性
自适应