毕设-周报-20150520

上周:
基线的ROC曲线去要重新确定判定标准,发往论文作者处邮件回复如下(版权为论文作者所有):

The curve itself doesn’t matter. I was even suggested by the reviewers to use DET curves as they can better show the results.

For detections, we work on the sentence level. For example, if your sentence contains the keyword, and your decision for detection is YES, then you get a correct detection. We didn’t check the alignment explicitly (i.e., if the boundary of the detected keyword matches the keyword in the sentence exactly), but we only have short sentences, so I guess that’s OK. We make one YES/NO decision for each sentence.

False alarm rate = # of false alarms / # of sentences
False rejection rate = # of false rejections / # of sentences (this is different from the traditional ROC curve)

对下一步大词汇量训练的建议如下:

es I’d suggest to use Librispeech, as we have 1000 hours for that. You’ll have to do forced alignment to generate those time information. If you train your system with Librispeech, I’m sure you’ll have training data alignment somewhere, and you can use that. For evaluation data, you can use your trained model and the reference to do the alignment, and that should be sufficient I think (and that’s what we did for the paper, but we worked on another dataset that is not publicly available).

上次周会至今在进行Librispeech test 训练集上的切词工作(切出模版),尽量选取原作者使用的若干关键词进行实验,同时修改绘制评测曲线的PYTHON脚本。结果整理在下篇推出。

毕设结尾,在尝试使用android转移基线系统的音素识别器做demo,同时继续大词汇量下的训练。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值