How to do listening test

The prepares before listening test


  1.     A test session should not last for more than 20-30 min. Longer tests should be divided into two parts with a break.
  2.     Suggest that no more than 10 to 15 trials per test session.
  3.     The content with short 10s is ideal; 20s is okay.
  4.     Match the loudness of all items under test; use reference level for playback
  5.     It is critical for subjects to be trained to hear the impairments you are trying to test.
  6.     Hidden reference: The MUSHRA test (BS-1534) uses the original unprocessed programme material with full bandwidth as the reference signal( as a hidden reference).
  7.     "Anchor" signals: The MUSHRA test (BS-1534) use two additional "anchor" signals. The standard anchor is a low-pass filtered version of the original signal with a cut-off frequency of 3.5 kHz; the mid quality anchor has a cut-off frequency of 7 kHz.
    •         Additional anchors are intended to provide an indication of how the systems under test compare to well-known audio quality levels.
    •         Should not be used for re-scaling results between different tests.
    •         These anchors must be known to be detectable to expert listeners but not to inexpert listeners.
    •         These anchors are also for the sensitivity of all other aspects of the experimental situation.


The rules when do listening test

  • Your scores need to be reproducible- if you're asked to retake the test next time, your scores should be approximately the same and consistent.
  • Aim for about 1 hour for a test. Listener fatigue can ruin your results
  • Make sure all speakers works well.
  • No more than 20 assessors are often sufficient.

Postscreening Subjects

  • Report should be discarded
    • Any listener who fails to identify the hidden reference more than 15% of the time must be discarded (BS-11116)
    • Any listener who scores the hidden reference below 90, more than 15% of the time, must be discarded.  Any listener who scores the 7kHz anchor above 90, more than 15% of the time, must be discarded (BS-1534 (MUSHRA))
  • It must be empirically and statistically shown that any failure to find differences among systems is not due to experimental insensitivity because of poor choices of audio material, or any other weak aspects of the experiment, before a "null" finding can be accepted as valid. It may be necessary to program special trials with low or medium anchors for the explicit purpose of examining subject expertise. (ITU-R BS.1116-3)

The criteria for assessors

Discrimination: A measure of the ability to perceive differences between test items.

Reliability: A measure of the closeness of repeated ratings of the same test item.

Standard Documents

StandardNamePage
Recommendation BS.1116Methods for the subjective assessment of small impairments in audio systemshttp://www.itu.int/rec/R-REC-BS.1116/en
Recommendation ITU-R BS.1534-3 (10/2015)Method for the subjective assessment of intermediate quality level of audio systemshttp://www.itu.int/rec/R-REC-BS.1534/en

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值