TensorflowTTS provides real-time state-of-the-art speech synthesis architectures such as Tacotron-2-Tensorflow, Melgan-Tensorflow, Multiband-Melgan-Tensorflow, FastSpeech-Tensorflow based-on TensorFlow 2. With Tensorflow 2, we can speed- up training/inference progress, optimizer further by using fake-quantize aware and pruning , make TTS models can be run faster than real-time and be able to deploy on mobile devices or embedded systems.
What’s new
2020/06/07 (New!) Multi-band MelGAN (MB MelGAN) implementation with Tensorflow is supported.
Features
High performance on Speech Synthesis.
Be able to fine-tune on other languages.
Fast, Scalable and Reliable.
Suitable for deployment.
Easy to implement new model based-on abtract class.
Mixed precision to speed-up training if posible.
Requirements
This repository is tested on Ubuntu 18.04 with:
Python 3.6+
Cuda 10.1
CuDNN 7.6.5
Tensorflow 2.2
Tensorflow Addons 0.9.1
Different Tensorflow version should be working but not tested yet. This repo will try to work with latest stable tensorflow version.
Installation
$ git clone https: //github.com/dathudeptrai/TensorflowTTS.git
$ cd TensorflowTTS
$ pip install .
If you want upgrade the repository and its dependencies:
$ git pull
$ pip install --upgrade .
Supported Model achitectures
TensorflowTTS currently provides the following architectures:
MelGAN released with the paper MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis by Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brebisson, Yoshua Bengio, Aaron Courville.
Tacotron-2 released with the paper Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions by Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry- Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu.
FastSpeech released with the paper FastSpeech: Fast, Robust and Controllable Text to Speech by Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu.
Multi-band MelGAN released with the paper Multi-band MelGAN: