hydrogenaudio 编解码标准测试结果

http://listening-tests.hydrogenaudio.org/igorc/results.html

Results of the public multiformat listening test @ 64 kbps (March/April 2011)


These are the summary results of the multiformat listening test @ 64 kbps.

You can download a ZIP file containing all results for all samples.

Encryption keys can be downloaded from here and here.

How to interpret the plots: Each plot is drawn with 5 codecs on the X axis and the rating given (1.0 to 5.0) on the Y axis. The 95% confidence intervals are given on each plot. The mean rating given to each codec is indicated by the middle point of each vertical line segment. Each vertical line segment represents the 95% confidence interval (using ANOVA analysis) for each codec. This analysis is identical to the one used in previous listening tests.

One codec can be said to be better than another with greater than 95% confidence if the bottom of its segment is at or above the top of the competing codec's line segment. Note that this is an approximate analysis with some assumptions, and the confidence is far greater in most cases. An almost assumption-free analysis (bootstrap) is below.

Note that CELT is referred to as Opus as this will be the standardized name.

Important note: These plots represent group preferences (for the particular group of people who participated in the test). Individual preferences vary somewhat. The best codec for a person is dependent on his own preferences and the type of music he prefers.


Plot of the complete result (30 samples, 531 results):

Full plot

Closeup of the interesting results (30 samples, 531 results):

Zoomed plot

Per-sample results

A page with graphics for each sample individually is here.

Bitrate table

The codecs and settings were calibrated to provide ~64kbps on a large variety of music.

These are the bitrates used by the codes for the samples in the test:

Codec bitrates

Bootstrap analysis:

 

	Read 5 treatments, 531 samples => 10 comparisons
	Means:
	      Vorbis   Nero_HE-AAC  Apple_HE-AAC          Opus    AAC-LC@48k 
	       3.513         3.547         3.817         3.999         1.656 

	Unadjusted p-values:
		  Nero_HE-AAC   Apple_HE-AAC  Opus          AAC-LC@48k   
	Vorbis        0.488         0.000*        0.000*        0.000*       
	Nero_HE-AAC   -             0.000*        0.000*        0.000*       
	Apple_HE-AAC  -             -             0.000*        0.000*       
	Opus          -             -             -             0.000*       

	Apple_HE-AAC is better than Vorbis (p=0.000)
	Apple_HE-AAC is better than Nero_HE-AAC (p=0.000)
	Opus is better than Vorbis (p=0.000)
	Opus is better than Nero_HE-AAC (p=0.000)
	Opus is better than Apple_HE-AAC (p=0.000)
	AAC-LC@48k is worse than Vorbis (p=0.000)
	AAC-LC@48k is worse than Nero_HE-AAC (p=0.000)
	AAC-LC@48k is worse than Apple_HE-AAC (p=0.000)
	AAC-LC@48k is worse than Opus (p=0.000)

	p-values adjusted for multiple comparison:
		  Nero_HE-AAC   Apple_HE-AAC  Opus          AAC-LC@48k   
	Vorbis        0.490         0.000*        0.000*        0.000*       
	Nero_HE-AAC   -             0.000*        0.000*        0.000*       
	Apple_HE-AAC  -             -             0.000*        0.000*       
	Opus          -             -             -             0.000*       

	Apple_HE-AAC is better than Vorbis (p=0.000)
	Apple_HE-AAC is better than Nero_HE-AAC (p=0.000)
	Opus is better than Vorbis (p=0.000)
	Opus is better than Nero_HE-AAC (p=0.000)
	Opus is better than Apple_HE-AAC (p=0.000)
	AAC-LC@48k is worse than Vorbis (p=0.000)
	AAC-LC@48k is worse than Nero_HE-AAC (p=0.000)
	AAC-LC@48k is worse than Apple_HE-AAC (p=0.000)
	AAC-LC@48k is worse than Opus (p=0.000)    
    

 

ANOVA analysis:

 

        FRIEDMAN version 1.24 (Jan 17, 2002) http://ff123.net/
	Blocked ANOVA analysis

	Number of listeners: 531
	Critical significance:  0.05
	Significance of data: 0.00E+00 (highly significant)
	---------------------------------------------------------------
	ANOVA Table for Randomized Block Designs Using Ratings

	Source of         Degrees     Sum of    Mean
	variation         of Freedom  squares   Square    F      p

	Total             2654        4521.67
	Testers (blocks)   530        1498.18
	Codecs eval'd        4        1893.65  473.41   888.29  0.00E+00
	Error             2120        1129.85    0.53
	---------------------------------------------------------------
	Fisher's protected LSD for ANOVA:   0.088

	Means:

	Opus     Apple_HE Nero_HE- Vorbis   AAC-LC@4 
	  4.00     3.82     3.55     3.51     1.66   

	---------------------------- p-value Matrix ---------------------------

		 Apple_HE Nero_HE- Vorbis   AAC-LC@4 
	Opus     0.000*   0.000*   0.000*   0.000*   
	Apple_HE          0.000*   0.000*   0.000*   
	Nero_HE-                   0.439    0.000*   
	Vorbis                              0.000*   
	-----------------------------------------------------------------------

	Opus is better than Apple_HE-AAC, Nero_HE-AAC, Vorbis, AAC-LC@48k
	Apple_HE-AAC is better than Nero_HE-AAC, Vorbis, AAC-LC@48k
	Nero_HE-AAC is better than AAC-LC@48k
	Vorbis is better than AAC-LC@48k
    

Notes:

The graphs are a simple ANOVA analysis over all submitted and valid results. This is compatible with the graphs of previous listening tests, but should only be considered as a visual support for the real analysis.

For a correct calculation of the statistical probability, and to see if one can safely make any conclusions, one has to refer to the bootstrap output. You can see that the results are highly significant for all but one comparison (Vorbis vs. Nero HE-AAC).

When crosschecking against the old friedman/ANOVA utility one can see that the results are almost identical, which is expected as there are many results and the codecs were mostly not transparent, so the results are reasonably normally distributed in this test.

A potential issue is that not every sample was tested by the same amount of listeners, and that notably, the first few samples got more submissions than later ones. Preliminary analsysis (by only including the listeners that tested all samples) shows that this makes no difference to the conclusions.

Post-screening:

Invalid results were discarded according to the following criteria, which were made public at the beginning of the test:

  • If the listener ranked the reference worse than 4.5 on a sample, the listener's results for that sample were discarded.
  • If the listener ranked the low anchor at 5.0 on a sample, the listener's results for that sample were discarded.
  • If the listener ranked the reference below 5.0 on more than 4 samples, all of that listener's results were discarded.

 

Contact

IgorC: igoruso@gmail.com

转载于:https://www.cnblogs.com/gaozehua/archive/2012/04/13/2446332.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
毕业设计,基于SpringBoot+Vue+MySQL开发的影城管理系统,源码+数据库+论文答辩+毕业论文+视频演示 随着现在网络的快速发展,网上管理系统也逐渐快速发展起来,网上管理模式很快融入到了许多生活之中,随之就产生了“小徐影城管理系统”,这样就让小徐影城管理系统更加方便简单。 对于本小徐影城管理系统的设计来说,系统开发主要是采用java语言技术,在整个系统的设计中应用MySQL数据库来完成数据存储,具体根据小徐影城管理系统的现状来进行开发的,具体根据现实的需求来实现小徐影城管理系统网络化的管理,各类信息有序地进行存储,进入小徐影城管理系统页面之后,方可开始操作主控界面,主要功能包括管理员:首页、个人中心、用户管理、电影类型管理、放映厅管理、电影信息管理、购票统计管理、系统管理、订单管理,用户前台;首页、电影信息、电影资讯、个人中心、后台管理、在线客服等功能。 本论文主要讲述了小徐影城管理系统开发背景,该系统它主要是对需求分析和功能需求做了介绍,并且对系统做了详细的测试和总结。具体从业务流程、数据库设计和系统结构等多方面的问题。望能利用先进的计算机技术和网络技术来改变目前的小徐影城管理系统状况,提高管理效率。 关键词:小徐影城管理系统;Spring Boot框架,MySQL数据库
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值