TensorflowTTS:Tensorflow 2实现的最先进实时语音合成

TensorflowTTS是一个基于Tensorflow 2的实时语音合成框架,支持Tacotron-2、Melgan等多种先进模型。该库提供高速、可扩展且可靠的语音合成解决方案,适用于移动设备和嵌入式系统的部署。安装简单,适用于多种模型架构,并提供了从数据预处理到模型训练和推断的完整教程。
摘要由CSDN通过智能技术生成

TensorflowTTS provides real-time state-of-the-art speech synthesis architectures such as Tacotron-2-Tensorflow, Melgan-Tensorflow, Multiband-Melgan-Tensorflow, FastSpeech-Tensorflow based-on TensorFlow 2. With Tensorflow 2, we can speed- up training/inference progress, optimizer further by using fake-quantize aware and pruning , make TTS models can be run faster than real-time and be able to deploy on mobile devices or embedded systems.

What’s new
2020/06/07 (New!) Multi-band MelGAN (MB MelGAN) implementation with Tensorflow is supported.
Features
High performance on Speech Synthesis.
Be able to fine-tune on other languages.
Fast, Scalable and Reliable.
Suitable for deployment.
Easy to implement new model based-on abtract class.
Mixed precision to speed-up training if posible.
Requirements
This repository is tested on Ubuntu 18.04 with:

Python 3.6+
Cuda 10.1
CuDNN 7.6.5
Tensorflow 2.2
Tensorflow Addons 0.9.1
Different Tensorflow version should be working but not tested yet. This repo will tr​​y to work with latest stable tensorflow version.

Installation
$ git clone https: //github.com/dathudeptrai/TensorflowTTS.git
$ cd TensorflowTTS
$ pip install .
If you want upgrade the repository and its dependencies:

$ git pull
$ pip install --upgrade .
Supported Model achitectures
TensorflowTTS currently provides the following architectures:

MelGAN released with the paper MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis by Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brebisson, Yoshua Bengio, Aaron Courville.
Tacotron-2 released with the paper Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions by Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry- Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu.
FastSpeech released with the paper FastSpeech: Fast, Robust and Controllable Text to Speech by Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu.
Multi-band MelGAN released with the paper Multi-band MelGAN:

  • 0
    点赞
  • 4
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值