SAD论文阅读笔记-INTERSPEECH2019

最新推荐文章于 2023-09-04 22:52:43 发布

VIP文章 slowmovingsnail

最新推荐文章于 2023-09-04 22:52:43 发布

阅读量997

点赞数

分类专栏： SAD

本文链接：https://blog.csdn.net/qzhou961/article/details/104896450

版权

Two-Dimensional Convolutional Recurrent Neural Networks for Speech Activity Detection¹

发表于INTERSPEECH2019

研究背景

The 2019 Inaugural Fearless Steps Challenge - Task1: Speech Activity Detection² 链接：link
本文方法在比赛所有27个提交系统中性能排名第一（1/27）：DCF=3.318% (on evaluation dataset)

数据集：the Fearless Steps (FS) Challenge Corpus

comprised of three mission critical stages from the NASA’s Apollo-11 mission, viz., Lift Off, Lunar Landing, and Lunar Walking
30 individual synchronized analog communications channels with multiple speakers in different locations working real-time to accomplish NASA’s Apollo missions
most of the audio channels suffer from a wide range of issues like high channel noise, system noise, attenuated signal bandwidth, transmission noise, cosmic noise, analog tape static noise, noise from tape aging, etc., with noise levels varying within each channel across time

数据集时长：training set ~60h; development set ~20h10min; evaluation set ~20h；采样率：8KHz
NOTE: The training labels provided, are not ground truth; they are system outputs generated by our Baseline Systems.

评价指标：DCF (Detection Cost Function)

$DCF(\theta)=0.75*P_{FN}(\theta)+0.25*P_{FP}(\theta)$

最低0.47元/天解锁文章

slowmovingsnail

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
SAD论文阅读笔记-INTERSPEECH2019

Two-Dimensional Convolutional Recurrent Neural Networks for Speech Activity Detection1发表于INTERSPEECH2019研究背景The 2019 Inaugural Fearless Steps Challenge - Task1: Speech Activity Detection2 链接：link...
复制链接

扫一扫