语音识别技术资源汇总-CSDN博客

一.会议

1.1国际最顶尖会议

ICASSP：International Conference on Acoustics, Speech and Signal Processing 每年一届，10月截稿，次年5月开会。国际声学语言和信号处理的旗舰会议，ICASSP是信号处理领域最权威的会议之一，是声学、语音信号处理方面最顶级的学术会议，也是图像、视频信号处理领域的权威会议之一，每年举办一次。其学术水平和影响力在语音信号处理领域属于最重要的国际会议。
ICSLP:International Conference on Semiconductor Laser and Photonics 偶数年举办，4月截稿，9月开会
EuroSpeech:European Conference on Speech Communication and Technology 奇数年举办，4月截稿，9月开会
附上2019年Interspeech会议的论文集：
https://www.isca-speech.org/archive/Interspeech_2019/

1.2其他

ICSMC:Int l Conference on Systems, Man & Cybernetics
NAECON:National Aerospace and Electronics Conference
ICTTA:International Conference on Telecommunication Technology and Applications
ISSPA: Information Sciences, Signal Processing and their Applications
ISPACS:International Symposium on Intelligent Signal Processing and Communications Systems
SBEC:Southern Biomedical Engineering Conference
ICAPR:International Conference on Advances in Pattern Recognition
ICOSP: International Conference on Signal Processing Proceedings
ICSLP: International Conference on Spoken Language Processing
ICICIC:International Conference on Innovative Computing, Information and Control
IEMBS:Institute of Electrical and Electronics Engineers
NLPKE: Natural Language Processing and Knowledge Engineering
IECON:Conference of the IEEE Industrial Electronics Society
ICCT:International Council on Clean Transportation
ASRU:Automatic Speech Recognition and Understanding
ISCAS:International Symposium on Circuits and Systems
ISPACS:International Symposium on Intelligent Signal Processing and Communications Systems
ICDSP:International Conference on Digital Signal Processing
SPAWC:signal processing advances in wireless communications
ICCSIT: International Conference on Computer Science and Information Technology
ICSE: International Conference on Software Engineering
ICIAS:International Conference on Intelligent and Advanced Systems
TENCON:Technical Environmental Consulting
ICFCC:International Conference on Future Computer and Communication
WCICA:World Congress on Intelligent Control and Automation
MMSP:international workshop on multimedia signal processing
IROS: Intelligent Robots and Systems
ICSDA: INTERNATIONAL COMBATIVES SELF DEFENSE ASSOCIATION
ICCCE:International Conference on Computer and Communication Engineering
其他的会议还有：ISPA，ASPAA，INDICO，NetCom等

二. 期刊

2.1国内

声学学报
应用声学
声学工程
信号处理
电子学报

2.2国外：

IEEE Signal Processing Magazine (IF：2.655)，一年6期，是双月刊）
Computer Speech and Language (CSL)（IF：1.776）
Digital Signal Processing（IF: 0.889）
IEE Electronics Letters (IF：1.063）
IEEE Signal Processing Letters (SPL)---(IF: 0.722)
IEEE Transactions on Audio, Speech and Language Processing (IF:2.950)
IEEE Transactions on Circuits and Systems-II: Express Briefs (CAS-II)---(IF：0.922)
IEEE Transactions on Signal Processing (TSP)-- (IF：1.57)
IEEE Transactions on Circuits and Systems-I: Regular Papers (CAS-I)---(IF：1.139)
IET Signal Processing（IF：1.250）
Signal Processing (IF: 0.669)
Signal Processing: Image Communication (IF: 1.109)
Speech Communication(IF:1.585)其中IF为影响因子

三.国际语音识别技术研究机构

AT&T  http://www.research.att.com/editions/201304_home.html
ATR    http://www.slt.atr.co.jp/index.html
BBN    http://www.bbn.com/technology/speech_recognition/
Cambridge University Engineering Department (CUED) http://mi.eng.cam.ac.uk/
Carnegie Mellon University (CMU) 
HP Labs   http://www.hpl.hp.com/
Columbia University 
Centre for Speech Technology Research at Edinburgh University 
ESAT - PSI Speech Group at K.U.Leuven 
International Computer Science Institute (ICSI) 
IBM Human Language Technologies     http://www.research.ibm.com/hlt/
IDIAP Research Institute 
INESC-ID Lisboa, Spoken Language Systems Lab 
IRST 
ISIP 
Johns Hopkins University (CLSP) 
Speech, Music and Hearing at KTH 
LIMSI 
Alcatel Lucent (Bell Labs)  http://www.alcatel-lucent.com/wps/portal/BellLabs
Microsoft    http://research.microsoft.com/en-us/groups/speech/
MIT Spoken Language Systems 
Oregon Graduate Institute (OGI) Center for Spoken Language Understanding 
Speech and Language Processing Laboratory at Rutgers University 
RWTH Aachen 
University of Colorado, Boulder (CLEAR) 
University of Sheffield 
SRI 
Furui Laboratory, Tokyo Institute of Technology 
University of Illinois at Urbana and Champaign 
University of Washington 
Universitaet Erlangen-Nürnberg
剑桥大学 http://htk.eng.cam.ac.uk/
CMU大学 http://www.speech.cs.cmu.edu/
张智星 语音识别，机器学习 http://mirlab.org/jang/
安徽科大讯飞 http://www.iflytek.com/

四.国际语音识别技术评测

NIST Spoken Language Technology Evaluations Benchmark Tests
(http://www.nist.gov/speech/tests/index.htm)

五. 语音识别技术工具包

AT&T FSM Library
CMU-Cambridge Statistical LM Toolkit
CMU Sphinx
CSLU toolkit
CUED HTK
Edinburgh Speech Tools Library
KTH WaveSurfer
MSState ASR Toolkit
NIST Utility Software
SPRACHcore software package
SRI Language Modelling Toolkit
SoX – Sound eXchange
Transcriber
UCL Speech Filing System
FBVIEW multi-channel audio file viewer

1.6语音识别网站及相关论坛

http://www.voxforge.org/home/forums/message-boards/acoustic-model-discussions
http://bbs.matwav.com
http://www.yuyinshibie.com/
http://www.ctiforum.com/voice.html
http://liceu.uab.es/~joaquim/phonetics/fon_anal_acus/herram_anal_acus.html
http://www.phon.ucl.ac.uk/resource/scribe/

六.主页和博客

 1.bill  xia 的博客：http://ibillxia.github.io/blog/categories/assp/            这个大神的博客有深度学习的一些东西，有用。

  2.zouxy09的博客：http://blog.csdn.net/zouxy09/article/category/1218766   zouxy09大神对深度学习和机器学习都有研究，博客质量很高的

  3.台湾张智星教授的主页：http://mirlab.org/jang/    里面有一个语音课：音频信号处理和识别

  4.cmu大学的语音组：http://www.speech.cs.cmu.edu/   里面有很多链接

  5. dan ellis教授的主页：  http://www.ee.columbia.edu/~dpwe/    里面有很多工具箱

  6.dan povey大神的主页：http://www.danielpovey.com/index.html  kaldi的资料很多

  7.微软邓力老师的主页：http://research.microsoft.com/en-us/people/deng/  关于深度学习的语音识别资料

  8.王德江老师的主页：http://www.cse.ohio-state.edu/~dwang/pnl/software.html 关于语音识别 语音分离，音乐分离

  9.国外大神SnippyHolloW的github主页：https://github.com/SnippyHolloW

 10.自然语言处理的论坛：http://www.threedweb.cn/portal.php    非常多的资源

语音识别与合成
Speech at Carnegie Mellon University
鼎鼎大名的CMU语音组。著名的Sphinx系统的诞生地，李开复当年作研究的地方
http://fife.speech.cs.cmu.edu/
The Center for Language and Speech Processing (CLSP) at The Johns Hopkins University
著名的Jelinek教授领导的语言与语音处理组
http://www.clsp.jhu.edu/
Speech Research-----确是非常全的一个网站
国外比较全的一个语音技术研究的链接
http://mambo.ucsc.edu/psl/speech.html
Signal Compression Lab, Department of Electrical and Computer Engineering
著名的Allen Gersho教授所在的实验室－University of California, Santa Babra。该实验室包括好几位杰出的教授，像K.Rose, V. Cuperman等等。该校非常令人尊敬的地方是从这里毕业的学生有很多后来都成为了学术科研上的佼佼者
http://scl.ece.ucsb.edu/index.htm
The Speech Recognition Group
Rutgers大学CAPI中心下的语音识别组。论文集Modern Methods of Speech Processing中的R.P. Ramachandran就是该中心的教授
http://www.caip.rutgers.edu/ARPA-SLT
Speech Processing Laboratory at at Michigan State University
著名的Deller教授所领导的语音处理研究组
http://www.egr.msu.edu/~deller/speechlab_people.html
Purdue University Speech and Language Processing Research Group
Purdue大学语音处理研究组
http://wavelet.ecn.purdue.edu/~speechg
还有一个比较有名的做语音的科研机构, 日本名古屋工业大学的Keiichi TOKUDA教授,
他们在基于参数的语音合成领域非常有名气的, 他们开发的HTS平台目前的应用也非常广泛.
对于研究语音合成和语音识别的人会有帮助.地址如下:
http://www.sp.nitech.ac.jp/
HTS的主页为:
http://hts.sp.nitech.ac.jp/

语音识别工具箱：

1.kaldi：http://kaldi.sourceforge.net/

2.htk：http://htk.eng.cam.ac.uk/

3.RWTH:http://www-i6.informatik.rwth-aachen.de/rwth-asr/

4.sphinx:http://cmusphinx.sourceforge.net/

5.julius:http://julius.sourceforge.jp/en_index.php

说话人识别：

1.微软的开源库MSR Identity Toolkit v1.0：http://research.microsoft.com/en-us/downloads/a6262fec-03a7-4060-a08c-0b0d037a3f5b/

2.王德江老师的主页也有说话人识别的资料：http://www.cse.ohio-state.edu/~dwang/pnl/software.html