声明:转发本文请联系博主,并标明出处
语音合成技术近几年都有哪些论文呢?
我们整理了近6年的语音合成论文集分享给大家,希望可以为大家在深耕语音合成领域的过程中,提供绵薄助力。论文集按照年份和引用量列出。
文中加粗数字代表论文引用量,引用量由少及多排序。
2019年
1.111-ClariNet Parallel Wave Generation in End-to-End Text-to-Speech
2.115-Speech synthesis from neural decoding of spoken sentences
3.171-Waveglow A Flow-based Generative Network for Speech Synthesis22018年
1.62-VoiceLoop Voice Fitting and Synthesis via a Phonological Loop
2.84-Style Tokens Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
3.105-Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention
4.112-Deep Voice 3 2000-Speaker Neural Text-to-Speech
5.133-Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
6.153-A Survey on Automatic Detection of Hate Speech in Text
7.159-Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
8.294-Deep Voice 2 Multi-Speaker Neural Text-to-Speech
9.294-Parallel WaveNet Fast High-Fidelity Speech Synthesis
10.319-Audio Adversarial Examples Targeted Attacks on Speech-to-Text
11.2018-593-Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions
2017年
1.98-Generative adversarial network-based postfilter for statistical parametric speech synthesis
2.108-ASVspoof The Automatic Speaker Verification Spoofing and Countermeasures Challenge
3.132-Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks
4.133-Montreal Forced Aligner Trainable Text-Speech Alignment Using Kaldi.
5.244-Char2Wav End-to-End Speech Synthesis
6.291-Deep Voice Real-time Neural Text-to-Speech
7.449-Tacotron Towards End-to-End Speech Synthesis
2016年
1.58-Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices
2.78-Listen and Translate A Proof of Concept for End-to-End Speech-to-Text Translation
3.79-What sound symbolism can and cannot do Testing the iconicity of ideophones from five languages
4.85-Advances in phase-aware signal processing in speech communication
5.99-D4C, a band-aperiodicity estimator for high-quality speech synthesis
6.121-Investigating gated recurrent networks for speech synthesis
7.239-Deep Voice 2 Multi-Speaker Neural Text-to-Speech
8.523-WORLD A vocoder-based high-quality speech synthesis system for real-time applications