python实现百度语音之语音识别

最新推荐文章于 2024-08-28 19:22:22 发布

zhouxiaoxiao2015

最新推荐文章于 2024-08-28 19:22:22 发布

阅读量1.6w

点赞数 13

分类专栏： python 语音识别文章标签： python-语音

本文链接：https://blog.csdn.net/zwy8831/article/details/54406087

版权

python 同时被 2 个专栏收录

2 篇文章 0 订阅

订阅专栏

语音识别

1 篇文章 0 订阅

订阅专栏

这篇文字是基于前辈分享的基础上写出来的。
前辈在这里：
http://blog.sina.com.cn/s/blog_7cedb56d0102vb5p.html
http://blog.csdn.net/wolfblood_zzx/article/details/46418635
http://blog.csdn.net/happen23/article/details/45821697

我这里只使用了requests库，相对来说简单点：

#-*- coding: utf-8 -*-
import base64, requests
d = open('testxf.pcm', 'rb').read()
data = {
    "format": "pcm",
    # "format": "wav",
    "rate": 16000,
    "channel": 1,
    "token": "your token",
    "cuid": "your mac",
    "len": len(d),
    "speech": base64.encodestring(d).replace('\n', '')
}
result = requests.post('http://vop.baidu.com/server_api', json=data, headers={'Content-Type': 'application/json'})
data_result = result.json()
print data_result['err_msg']
if data_result['err_msg']=='success.':
    print "语音结果：" + data_result['result'][0].encode('utf-8')
else:
    print data_result

这里获取TOKEN的代码，就没贴出来了，可以参考前辈的代码。
特别注意：”speech”: base64.encodestring(d).replace(‘\n’, ”)，这也是参考一位前辈的指导，需要去掉最后的回车（不好意思没记住前辈的地址）。
另外，我测试了2个自己的WAV文件，清晰的可以识别，噪音比较大的识别不了。后面研究下降噪，看能不能识别出来。