几种C++编译器的性能比较

最新推荐文章于 2024-08-09 03:05:54 发布

jarodpku

最新推荐文章于 2024-08-09 03:05:54 发布

阅读量3.8k

点赞数 1

分类专栏： C++ 文章标签：编译器 c++ 优化 gcc borland 测试

本文链接：https://blog.csdn.net/jarodpku/article/details/3327711

版权

C++ 专栏收录该内容

13 篇文章 0 订阅

订阅专栏

现在市面上，主流的C/C++编译器包括M$的CL、gcc、Intel的icl、PGI的pgcc及Codegear的bcc（原来属于Borland公司）。Windows上使用最多的自然是cl，而在更广阔的平台上，gcc则是C/C++编译器的首选。但要提到能力优化，排名就未必与它们的市场占有率一致了。

今天一时兴起，便做了一个各编译器数值性能的比较。测试的代码是一个求积分的程序，来源于intel编译器的例子程序，修改了一个头文件，以便每个编译器都能编译。

 
 #include <stdio.h> 
#include <stdlib.h>  
#include <time.h>  
#include <math.h> 
// Function to be integrated 
// Define and prototype it here 
// | sin(x) | 
#define INTEG_FUNC(x)  fabs(sin(x)) 
// Prototype timing function 
double dclock(void);
int main(void)
{
   // Loop counters and number of interior points 
   unsigned int i, j, N;
   // Stepsize, independent variable x, and accumulated sum 
   double step, x_i, sum;
   // Timing variables for evaluation    
   double start, finish, duration, clock_t;
   // Start integral from  
   double interval_begin = 0.0;
   // Complete integral at  
   double interval_end = 2.0 * 3.141592653589793238;
   // Start timing for the entire application 
   start = clock();
   printf("     /n");
   printf("    Number of    | Computed Integral | /n");
   printf(" Interior Points |                   | /n");
   for (j=2;j<27;j++)
   {
    printf("------------------------------------- /n");
     // Compute the number of (internal rectangles + 1) 
     N =  1 << j;
     // Compute stepsize for N-1 internal rectangles  
     step = (interval_end - interval_begin) / N;
     // Approx. 1/2 area in first rectangle: f(x0) * [step/2]  
     sum = INTEG_FUNC(interval_begin) * step / 2.0;
     // Apply midpoint rule: 
     // Given length = f(x), compute the area of the 
     // rectangle of width step 
     // Sum areas of internal rectangle: f(xi + step) * step  
     for (i=1;i<N;i++)
     {
        x_i = i * step;
        sum += INTEG_FUNC(x_i) * step;
     }
     // Approx. 1/2 area in last rectangle: f(xN) * [step/2]  
     sum += INTEG_FUNC(interval_end) * step / 2.0;
     printf(" %10d      |  %14e   | /n", N, sum);
   }
   finish = clock();
   duration = (finish - start);
   printf("     /n");
   printf("   Application Clocks   = %10e  /n", duration);
   printf("     /n");
   return 0;
}
 

当然，这个代码来自于intel，当然非常适合intel的编译器。以下的测试在Intel Core 2 Duo上进行。

	gcc (GCC TDM-2 for MinGW) 4.3.0	VC 9.0 (cl 15.00.21022.08)	Intel (icl 10.1)	PGI (pgcc 7.16)	CodeGear (bcc32 6.10)
禁止优化
	-O0	/Od	-Od	-O0	-Od
	17161	14461	12441	10514	13400
	17133	14430	11687	9956	12917
	17155	14476	11871	10099	13026
编译选项 -O2
	13011	7737	4540	9348	12636
	16571	7706	4185	9148	13026
	16573	7706	4042	9183	13057
针对平台的优化
	-march=core2 -O2	/arch:SSE2 /O2	-QxT	-tp core2 -O2	无
	16060	7710	1938	9578

测试的结果说明，在数值计算方法，intel的编译器是非常利害的，特别是针对某CPU的优化，能提高很多性能。GCC表现却有些让人失望。在禁止优化到-O2级优化的对比中，可以看出intel与m$的编译器的优化效果是非常明显的，而其它编译器优化后的提高非常有限。如果给个排名，那么将是 icl>cl>pgcc>bcc>gcc。

另外，在一台P4 1.5G的机器，linux环境下，测试得到