- 博客(1)
- 资源 (13)
- 收藏
- 关注
原创 说话人识别模型(GMM-UBM)
1 声纹能作为判别不同人的依据是什么 每个人的声音都有独特的特征,这个特征由两个音素决定: 1 声腔的尺寸 2 发声器官被操纵的方式(比如声带上的肌肉运动) 这些因素使得声音变得独一无二 2 简述一下说话人识别流程 先进行特征提取,然后训练模型,最后是打分判决. 这其中特征提取包括 预加重,分帧加窗,傅里叶变换得到频谱图,之后再进行mel滤波使频谱图更紧...
2018-08-28 11:25:46 9922 1
Determination of the Instants of Glottal Closure from Speech Wave
Determination of the instants ofglottal closure (GC) from speech wave using wavelet transform is equivalent to finding a particular local modulus maxima pattern across several scales in the time-scale plan. It is shown that the local modulus maxima of wavelet transform corresponding to the event of GC are generally zigzag around the instant of GC and that the amount of the shift of the local modulus maxima from the instant of GC when evolving across scales can be well sketched with an evolution cone of GC maxima. An efficient algorithm that make use of the evolution cone of GC maxima to detect the instant of GC is presented together with three erformance measurements regarding the accuracy of location, false alarm rate, and fail alarm rate of the GC instant detection algorithm. Intensive experiments are carried out to test the validity of this novel method.
2018-10-01
基于声门闭合时刻估计的语音基音周期的提取
根据语音信号在声门闭合时刻(GCI)的锐变特性 ,通过检测语音信号小波变换发生在 GCI 附近的局部极大值的位置估计声门闭合时刻 ,两个相邻 GCI 的距离为该段语音的基音周期 ,它的倒数即为基频 F0 .实验结果表明 ,采用本方法检测到的基音周期准确 ,能按基音周期及时反映它的迅速变化 , 且抗噪声性能强 ,还能可靠的判别清/浊音
2018-10-01
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人