python 变声器_Tensorflow中的语音转换/变声器的深度神经网络

最新推荐文章于 2024-06-28 13:39:16 发布

weixin_39724004

最新推荐文章于 2024-06-28 13:39:16 发布

阅读量789

点赞数

文章标签： python 变声器

本文介绍了一种使用非平行数据进行语音转换的项目，目标是将声音转化为著名女演员凯特·温斯莱特的声音。通过两个模块的深度神经网络实现：Net1进行音素分类，Net2进行语音合成。利用CBHG模块捕获序列数据特征，项目中训练和测试的准确率均达到一定水平。提供了训练和转换阶段的步骤以及一些实施技巧。

摘要由CSDN通过智能技术生成

Voice Conversion with Non-Parallel Data

Subtitle: Speaking like Kate Winslet

Samples

Intro

What if you could imitate a famous celebrity's voice or sing like a famous singer? This project started with a goal to convert someone's voice to a specific target voice. So called, it's voice style transfer. We worked on this project that aims to convert someone's voice to a famous English actress Kate Winslet's voice. We implemented a deep neural networks to achieve that and more than 2 hours of audio book sentences read by Kate Winslet are used as a dataset.

Model Architecture

This is a many-to-one voice conversion system. The main significance of this work is that we could generate a target speaker's utterances without parallel data like , or , but only waveforms of the target speaker. (To make these parallel datasets needs a lot of eff

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39724004

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python 变声器_Tensorflow中的语音转换/变声器的深度神经网络

Voice Conversion with Non-Parallel DataSubtitle: Speaking like Kate WinsletSamplesIntroWhat if you could imitate a famous celebrity's voice or sing like a famous singer? This project started with a go...
复制链接

扫一扫