转-语音识别相关资料

国际最顶尖会议:
ICASSP:International Conference on Acoustics, Speech and Signal Processing
ICSLP:International Conference on Semiconductor Laser and Photonics
EUROSPEECH:European Conference on Speech Communication and Technology
其他
ICSMC:Int l Conference on Systems, Man & Cybernetics
NAECON:National Aerospace and Electronics Conference
ICTTA:International Conference on Telecommunication Technology and Applications
ISSPA: Information Sciences, Signal Processing and their Applications
ISPACS:International Symposium on Intelligent Signal Processing and Communications Systems
SBEC:Southern Biomedical Engineering Conference
ICAPR:International Conference on Advances in Pattern Recognition
ICOSP: International Conference on Signal Processing Proceedings
ICSLP: International Conference on Spoken Language Processing
ICICIC:International Conference on Innovative Computing, Information and Control
IEMBS:Institute of Electrical and Electronics Engineers
NLPKE: Natural Language Processing and Knowledge Engineering
IECON:Conference of the IEEE Industrial Electronics Society
ICCT:International Council on Clean Transportation
ASRU:Automatic Speech Recognition and Understanding
ISCAS:International Symposium on Circuits and Systems
ISPACS:International Symposium on Intelligent Signal Processing and Communications Systems
ICDSP:International Conference on Digital Signal Processing
SPAWC:signal processing advances in wireless communications
ICCSIT: International Conference on Computer Science and Information Technology
ICSE: International Conference on Software Engineering
ICIAS:International Conference on Intelligent and Advanced Systems
TENCON:Technical Environmental Consulting
ICFCC:International Conference on Future Computer and Communication
WCICA:World Congress on Intelligent Control and Automation
MMSP:international workshop on multimedia signal processing
IROS: Intelligent Robots and Systems
ICSDA: INTERNATIONAL COMBATIVES SELF DEFENSE ASSOCIATION
ICCCE:International Conference on Computer and Communication Engineering
其他的会议还有:ISPA,ASPAA,INDICO,NetCom等

期刊方面:
国内:信号处理,电子学报、声学学报,应用声学,声学工程
国外:最著名的:IEEE Signal Processing Magazine (IF:2.655),一年6期,是双月刊)
还有IEEE Transactions on Signal Processing (TSP)– (IF:1.57)
IEEE Transactions on Circuits and Systems-I: Regular Papers (CAS-I)—(IF:1.139)
Signal Processing: Image Communication (IF: 1.109)
IEE Electronics Letters (IF:1.063)
IEEE Transactions on Circuits and Systems-II: Express Briefs (CAS-II)—(IF:0.922)
Digital Signal Processing(IF: 0.889)
IEEE Signal Processing Letters (SPL)—(IF: 0.722)
Signal Processing (IF: 0.669)
IET Signal Processing
其中IF为影响因子

1.1 国际语音识别技术研究机构
AT&T http://www.research.att.com/editions/201304_home.html
ATR http://www.slt.atr.co.jp/index.html
BBN http://www.bbn.com/technology/speech_recognition/
Cambridge University Engineering Department (CUED) http://mi.eng.cam.ac.uk/
Carnegie Mellon University (CMU)
HP Labs http://www.hpl.hp.com/
Columbia University
Centre for Speech Technology Research at Edinburgh University
ESAT - PSI Speech Group at K.U.Leuven
International Computer Science Institute (ICSI)
IBM Human Language Technologies http://www.research.ibm.com/hlt/
IDIAP Research Institute
INESC-ID Lisboa, Spoken Language Systems Lab
IRST
ISIP
Johns Hopkins University (CLSP)
Speech, Music and Hearing at KTH
LIMSI
Alcatel Lucent (Bell Labs) http://www.alcatel-lucent.com/wps/portal/BellLabs
Microsoft http://research.microsoft.com/en-us/groups/speech/
MIT Spoken Language Systems
Oregon Graduate Institute (OGI) Center for Spoken Language Understanding
Speech and Language Processing Laboratory at Rutgers University
RWTH Aachen
University of Colorado, Boulder (CLEAR)
University of Sheffield
SRI
Furui Laboratory, Tokyo Institute of Technology
University of Illinois at Urbana and Champaign
University of Washington
Universitaet Erlangen-Nürnberg

剑桥大学
http://htk.eng.cam.ac.uk/

CMU大学
http://www.speech.cs.cmu.edu/

张智星 语音识别,机器学习
http://mirlab.org/jang/
安徽科大讯飞
http://www.iflytek.com/

1.2 国际语音识别技术期刊
(1)Speech Communication
(2)Computer Speech and Language (CSL)
(3)IEEE Transactions on Speech and Audio Processing

1.3 国际语音识别技术会议
(1)ICASSP(International Conference on Acoustic, Speech and Signal Processing)
每年一届,10月截稿,次年5月开会。
(2)ICSLP(International Conference on Spoken Language Processing)
偶数年举办,4月截稿,9月开会。
(3)EuroSpeech:奇数年举办,4月截稿,9月开会。

1.4 国际语音识别技术评测
NIST Spoken Language Technology Evaluations Benchmark Tests
(http://www.nist.gov/speech/tests/index.htm)

1.5 语音识别技术工具包
AT&T FSM Library
CMU-Cambridge Statistical LM Toolkit
CMU Sphinx
CSLU toolkit
CUED HTK
Edinburgh Speech Tools Library
KTH WaveSurfer
MSState ASR Toolkit
NIST Utility Software
SPRACHcore software package
SRI Language Modelling Toolkit
SoX – Sound eXchange
Transcriber
UCL Speech Filing System
FBVIEW multi-channel audio file viewer

1.6语音识别网站及相关论坛

http://www.voxforge.org/home/forums/message-boards/acoustic-model-discussions
http://bbs.matwav.com
http://www.yuyinshibie.com/
http://www.ctiforum.com/voice.html
http://liceu.uab.es/~joaquim/phonetics/fon_anal_acus/herram_anal_acus.html
http://www.phon.ucl.ac.uk/resource/scribe/

  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
深度学习-语音识别是一种利用神经网络模型进行语音识别的技术。Python语言在深度学习领域有广泛应用,并提供了多个相关框架,如TensorFlow和PyTorch,便于实现语音识别任务。 在进行深度学习-语音识别实战时,首先需要准备语音数据集。可以使用已有的公开数据集,也可以自己收集。常用的数据集包括LibriSpeech、TIMIT和Aurora4等。 接下来,需要将语音数据集进行预处理。预处理步骤包括数据清洗、标签处理和特征提取。数据清洗可以剔除噪音和异常数据,标签处理主要是将语音数据与对应的文字标签进行对齐,特征提取则是将语音信号换为机器学习算法可以处理的数值特征,常用的特征提取方法有Mel频谱特征和MFCC等。 然后,需要选择合适的神经网络模型进行语音识别。常用的模型包括基于卷积神经网络(CNN)和循环神经网络(RNN)的混合模型。模型的选择要根据数据集的规模和任务需求进行。 在实际实现中,可以利用Python的深度学习框架(如TensorFlow和PyTorch)来构建和训练神经网络模型。框架提供了高级的API和函数,可以方便地定义模型结构、进行参数优化和模型评估。 训练完成后,可以使用训练好的模型进行语音识别。使用Python进行语音识别时,可以使用语音信号进行预测,并得到相应的文本结果。 总之,深度学习-语音识别实战是一个复杂的任务,但借助Python的强大生态系统和深度学习框架,可以更高效地进行模型训练和识别任务的实现。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值