python 变声器_Tensorflow中的语音转换/变声器的深度神经网络

本文介绍了一种使用非平行数据进行语音转换的项目,目标是将声音转化为著名女演员凯特·温斯莱特的声音。通过两个模块的深度神经网络实现:Net1进行音素分类,Net2进行语音合成。利用CBHG模块捕获序列数据特征,项目中训练和测试的准确率均达到一定水平。提供了训练和转换阶段的步骤以及一些实施技巧。
摘要由CSDN通过智能技术生成

Voice Conversion with Non-Parallel Data

Subtitle: Speaking like Kate Winslet

Samples

Intro

What if you could imitate a famous celebrity's voice or sing like a famous singer? This project started with a goal to convert someone's voice to a specific target voice. So called, it's voice style transfer. We worked on this project that aims to convert someone's voice to a famous English actress Kate Winslet's voice. We implemented a deep neural networks to achieve that and more than 2 hours of audio book sentences read by Kate Winslet are used as a dataset.

Model Architecture

This is a many-to-one voice conversion system. The main significance of this work is that we could generate a target speaker's utterances without parallel data like , or , but only waveforms of the target speaker. (To make these parallel datasets needs a lot of eff

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值