深度学习-语音合成

*Major*

已于 2023-10-09 08:46:54 修改

阅读量352

点赞数

文章标签：深度学习人工智能

于 2020-07-25 18:49:16 首次发布

本文链接：https://blog.csdn.net/qq_41375318/article/details/107583057

版权

TTS Synthesis with Bidirectional LSTM based Recurrent Neural
Networks
WaveNet: A Generative Model for Raw Audio
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
Char2Wav: End-to-end speech synthesis
Deep Voice: Real-time Neural Text-to-Speech
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Statistical Parametric Speech Synthesis Using Generative Adversarial Networks Under A Multi-task Learning Framework
Tacotron: Towards End-to-End Speech Synthesis
VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Deep Voice 3: Scaling text-to-speech with convolutional sequence learning
ClariNet Parallel Wave Generation in End-to-End Text-to-Speech
LPCNET: IMPROVING NEURAL SPEECH SYNTHESIS THROUGH LINEAR PREDICTION
Neural Speech Synthesis with Transformer Network
Glow-TTS：A Generative Flow for Text-to-Speech via Monotonic Alignment Search
FLOW-TTS: A NON-AUTOREGRESSIVE NETWORK FOR TEXT TO SPEECH BASED ON FLOW
Conditional variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS