open source

28 篇文章 0 订阅

speech recognition:

1.kladi

This is one of the newer speech recognition tool kits, but it has made a name for itself fast! Development began in 2009 at a workshop at John Hopkins University called “Low Development Cost, High Quality Speech Recognition for New Languages and Domains”.

After working on the project for a couple of years, the code for Kaldi was released on May 14, 2011. Kaldi quickly gained a reputation for its ease to work with.

Daniel Povey, who was one of the original developers, still maintains and updates Kaldi, so don’t expect this toolkit to go stale anytime soon. Here are all the resources you’ll need for Kaldi:

2.CMUSphinx

CMUSphinx, or called Sphinx for short, is actually a group of speech recognition systems developed by the Carnegie Mellon University.  There are several packages, each designed for different tasks and applications.

One of these includes Pocketsphinx, which is a version of sphinx that can be used in embedded systems. Take a look at the resources below for everything you need to know regarding Sphinx:

3.HTK

Hidden Markov Model Toolkit (HTK) was made for handling HMMs. HMM is a statistical parametric synthesis technique. While HTK is mainly used for speech recognition, it can also be used for text-to-speech and for DNA sequencing.

HTK was developed at the Machine Learning Laboratory in the Cambridge University Engineering Department. Today, Microsoft has the copyright to the original HTK code. However, changes to the source code are encouraged by Microsoft.

New versions of HTK are released on a consistently, with the latest release in December 2015.

4.Simon

Simon is a speech recognition toolkit that provides an easy-to-use user interface. The simple structure and friendly user-interface are some of Simon’s biggest strengths. Simon actually uses CMUSphinx, HTK, and Julius (mentioned below) as the foundation of their toolkit.

Simon is known as a popular speech recognition tool for Linux, although it can also work with Windows.

5.Julius

Julius is a two-pass large vocabulary continuous speech recognition (LVCSR) engine. Born in 1997, Julius continues to be developed by the Interactive Speech Technology Consortium.

Currently, Japanese is the only language model that’s fully available with Julius. A sample English acoustic model is available, but cannot be used for commercial purposes. The VoxForge-Project is working on creating an English language acoustic model for Julius.

Machine Translation

1.OpenNMT

2.tensorflow

3.fairseq

4.Moses

5.THUMT "THUMT: An Open Source Toolkit for Neural Machine Translation"

6.sockeys https://github.com/awslabs/sockeye

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值