papers
Grace_yanyanyan
这个作者很懒,什么都没留下…
展开
-
The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019
The LeVoice Far-field Speech Recognition System for VOiCES from a Distance Challenge 2019Yulong Liang, Lin Yang, Xuyang Wang, Yingjie Li, Chen Jia, Junjie WangLenovo ResearchLiangyl3@lenovo.com重点在...翻译 2020-02-12 22:54:55 · 486 阅读 · 0 评论 -
Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression
Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression标题:使用混合比特率opus压缩的多声道声学建模作者: Aparna Khare, Minhua Wu链接:https://arxiv.org/abs/2002.00122Recent literature has shown that a learned...翻译 2020-02-04 16:35:11 · 201 阅读 · 0 评论 -
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks
Single Channel Speech Enhancement Using Temporal Convolutional Recurrent Neural Networks标题:基于时域卷积递归神经网络的单通道语音增强作者: Jingdong Li, Changliang Li链接:https://arxiv.org/abs/2002.00319Jingdong Li∗ Hui Zha...翻译 2020-02-04 13:27:43 · 766 阅读 · 0 评论 -
Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification
Self-Attentive Speaker Embeddings for Text-Independent Speaker VerificationYingke Zhu1, Tom Ko2, David Snyder3, Brian Mak1, Daniel Povey31Department of Computer Science & EngineeringThe Hong Ko...翻译 2020-02-04 01:13:33 · 2063 阅读 · 1 评论 -
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech RecognitionDaniel S. Park∗, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. LeGoogle Brain{danielsp...翻译 2020-02-01 23:55:48 · 4087 阅读 · 0 评论 -
2018--Analysis of Length Normalization in End-to-End Speaker Verification System
Weicheng Cai2, Jinkun Chen2, Ming Li11Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, Guangzhou, China...翻译 2019-12-20 16:33:21 · 409 阅读 · 0 评论 -
2019-utterance-level end-to-end language identification using attention-based cnn-blstm--icassp 2019
Weicheng Cai1,2,Danwei Cai1, Shen Huang3and Ming Li1∗1Data Science Research Center, Duke Kunshan University, Kunshan, China2School of Electronics and Information Technology, Sun Yat-sen University, ...翻译 2019-12-20 16:09:41 · 344 阅读 · 0 评论 -
2019-SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORS
SPEAKER RECOGNITION FOR MULTI-SPEAKER CONVERSATIONS USING X-VECTORSDavid Snyder , Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev KhudanpurCenter for Language and Speech Proce...翻译 2019-12-20 12:35:37 · 511 阅读 · 0 评论 -
20191220--paper摘要
【5】 LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge标题:具有卷积前端的LSTM-TDNN用于2019年多流派广播挑战赛中的方言识别作者: Xiaoxiao Miao, Ian McLoughlin链接:https://...翻译 2019-12-20 10:53:48 · 195 阅读 · 0 评论 -
201912--时域音频分离和识别的端到端培训
【2】 Ene-to-end training of time domain audio separation and recognition标题:时域音频分离和识别的端到端培训作者: Thilo von Neumann, Reinhold Haeb-Umbach备注:5 pages, 1 figure, to appear in ICASSP 2020链接:https://arxiv.o...翻译 2019-12-19 13:16:09 · 1246 阅读 · 0 评论 -
201912一种改进动物音频分类的数据增强方法
Data augmentation approaches for improving animal audio classification标题:一种改进动物音频分类的数据增强方法作者: Loris Nanni, Michelangelo Paci链接:https://arxiv.org/abs/1912.07756本文利用卷积神经网络(CNNs)训练中不同的数据增强技术,提出了一组用于动...翻译 2019-12-18 13:26:51 · 2301 阅读 · 0 评论 -
2017--Deep Neural Network Embeddings for Text-Independent Speaker Verification
Deep Neural Network Embeddings for Text-Independent Speaker Verificationhttp://www.danielpovey.com/files/2017_interspeech_embeddings.pdfDavid Snyder, Daniel Garcia-Romero, Daniel Povey, Sanjeev Khud...翻译 2019-12-09 10:50:57 · 1144 阅读 · 0 评论 -
2019-On Investigation Of Unsupervised Speech Factorization Based On Normalization Flow
2019.10–On Investigation Of Unsupervised Speech Factorization Based On Normalization FlowHaoran Sun, Yunqi Cai, Lantian Li, Dong WangCSLT, Tsinghua University, ChinaABSTRACT语音信号是语音内容、说话人特征、信道效应等多种...翻译 2019-12-11 12:33:03 · 236 阅读 · 0 评论 -
2019-Gaussian-Constrained Training For Speaker Verification
2019.2–Gaussian-Constrained Training For Speaker VerificationLantian Li, Zhiyuan Tang, Ying Shi, Dong WangCenter for Speech and Language Technologies, RIIT, Tsinghua University, ChinaBeijing Nation...翻译 2019-12-11 12:32:40 · 467 阅读 · 1 评论 -
2018-Human And Machine Speaker Recognition Based On Short Trivial Events
Human And Machine Speaker Recognition Based On Short Trivial EventsMiao Zhang1,2, Xiaofei Kang1,3, Yanqing Wang1,2, Lantian Li1, Zhiyuan Tang1, Haisheng Dai4, Dong Wang1∗Center for Speech and Langu...翻译 2019-12-11 12:30:33 · 153 阅读 · 0 评论 -
2017-Speaker Recognition with Cough, Laugh and “Wei”
Speaker Recognition with Cough, Laugh and “Wei”Miao Zhang∗†, Yixiang Chen∗, Lantian Li∗and Dong Wang∗∗Center for Speech and Language Technologies (CSLT), RIIT, Tsinghua UniversityTsinghua National ...翻译 2019-12-11 12:29:46 · 195 阅读 · 0 评论 -
2019--Voxsrc 2019: The First Voxceleb Speaker Recognition Challenge
Voxsrc 2019: The First Voxceleb Speaker Recognition ChallengeJoon Son Chung1,2, Arsha Nagrani1, Ernesto Coto1, Weidi Xie1, Mitchell McLaren3,Douglas A Reynolds4and Andrew Zisserman11Visual Geometry...翻译 2019-12-11 12:27:15 · 1328 阅读 · 0 评论