python对接微软文字转语音

最新推荐文章于 2024-05-13 23:32:33 发布

X2818566

最新推荐文章于 2024-05-13 23:32:33 发布

阅读量3.7k

点赞数

本文链接：https://blog.csdn.net/X2818566/article/details/119412972

版权

该博客演示了如何使用Python对接微软的文字转语音服务，通过获取Access Token，然后利用SSML进行语音合成，最终将合成的语音保存为.wav文件。支持多种中文发音人和语气设置。

摘要由CSDN通过智能技术生成

import http.client, urllib.parse, json
from xml.etree import ElementTree
import wave

apiKey = "你的密钥"

params = "hello"
headers = {"Ocp-Apim-Subscription-Key": apiKey}

AccessTokenHost = "eastasia.api.cognitive.microsoft.com"
path = "/sts/v1.0/issueToken"

print ("Connect to server to get the Access Token")
conn = http.client.HTTPSConnection(AccessTokenHost)
conn.request("POST", path, params, headers)
response = conn.getresponse()
print(response.status, response.reason)

data = response.read()
conn.close()

accesstoken = data.decode("UTF-8")
print ("Access Token: " + accesstoken)

body = ElementTree.Element('speak', version='1.0')
body.set('{http://www.w3.org/XML/1998/namespace}lang', 'zh-CN')
voice = ElementTree.SubElement(body, 'voice')
voice.set('{http://www.w3.org/XML/1998/namespace}lang', 'zh-CN') //语言
voice.set('{http://www.w3.org/XML/1998/namespace}style', 'lyrical') //语气
voice.set('{http://www.w3.org/XML/1998/namespace}gender', 'Female')
voice.set('name', 'Microsoft Server Speech Text to Speech Voice (zh-CN, XiaoxiaoNeural)') //发音人
voice.text = '这次的事故要严格总结，防止下次再次发生' //要转化的文本内容

headers = {"Content-type": "application/ssml+xml",
           "X-Microsoft-OutputFormat": "riff-24khz-16bit-mono-pcm",
           "Authorization": "Bearer " + accesstoken,

最低0.47元/天解锁文章

X2818566

关注

0
点赞
踩
12

收藏

觉得还不错? 一键收藏
1
评论
python对接微软文字转语音

import http.client, urllib.parse, jsonfrom xml.etree import ElementTreeimport waveapiKey = "你的密钥"params = "hello"headers = {"Ocp-Apim-Subscription-Key": apiKey}AccessTokenHost = "eastasia.api.cognitive.microsoft.com"path = "/sts/v1.0/issueToken..
复制链接

扫一扫