大语言模型排行榜（2024年9月）

最新推荐文章于 2025-04-15 16:36:45 发布

陈小目

最新推荐文章于 2025-04-15 16:36:45 发布

阅读量9.9k

点赞数 44

文章标签：语言模型人工智能自然语言处理

本文链接：https://blog.csdn.net/CHENNNNNNNNNNN/article/details/141856342

版权

来源：

SuperCLUE总排行榜（2024年9月）：https://www.superclueai.com/

排名（开源+闭源）

排名	模型	机构	总分	理科得分	文科得分	Hard得分	使用方式
1	ChatGPT-4o-latest	OpenAI	79.67	81.5	78.62	78.87	API
2	Hunyuan-Turbo-Preview	腾讯	78.64	82.73	78.86	74.32	API
3	GPT-4-Turbo-2024-04-09	OpenAI	76.7	79.62	76.77	73.7	API
4	AndesGPT-2.0	OPPO	76.24	81.22	77.75	69.75	API
5	DeepSeek-V2-0628	深度求索	74.63	79.63	77.66	66.6	API
6	DeepSeek-Coder-V2-0724	深度求索	74.01	80.19	76.96	64.87	API
7	Qwen2-72B-Instruct	阿里巴巴	73.51	76.6	76.76	67.15	API
8	SenseChat 5.5	商汤	73.51	77.54	77.47	65.51	API
9	Claude 3.5 Sonnet	Anthropic	73.39	76.67	71.36	72.15	API
10	Gemini-1.5-Pro	Google	72.87	76.28	77.63	64.71	API
11	GPT-4o-mini	OpenAI	72.81	76.29	75.2	66.93	API
12	Doubao_pro_preview	字节跳动	72.03	75.98	75.75	64.37	API
13	GLM-4-0520	清华&智谱AI	70.99	73.39	74.29	65.28	API
14	Mistral-Large-Instruct-2407	Mistral AI	70.62	73.71	71.19	66.98	POE
15	山海大模型4.0	云知声	70.52	75.61	76.63	59.32	API
16	ERNIE-4.0-Turbo-8K	百度	70.13	76.23	74.8	59.36	API
17	Baichuan4	百川智能	69.03	73.47	75.04	58.58	API
18	MiniMax-abab6.5s	稀宇科技	68.84	69.96	76.69	59.87	API
19	Yi-Large	零一万物	68.23	72.6	74.45	57.64	API
20	360gpt2-pro	360	67.75	71.04	75.83	56.38	API
21	从容大模型1.5	云从科技	67.74	72.8	76.16	54.25	API
22	Qwen-Max	阿里巴巴	67.73	72.86	77.18	53.16	API
23	GPT-4-0613	OpenAI	67.08	71.52	70.47	59.25	API
24	Llama-3.1-405B-Instruct	Meta	66.5	73.7	68.89	56.92	POE
25	讯飞星火V4.0	科大讯飞	66.06	70.32	70	57.86	API
26	Step-1-32k	阶跃星辰	65.72	69.73	73.57	53.86	API
27	Moonshot(kimi)	月之暗面	65.31	66.8	75.22	53.92	网页
28	Llama-3.1-70B-Instruct	Meta	63.72	67.39	68.18	55.6	POE
29	Yi-1.5-34B-Chat-16K	零一万物	62.02	66.07	73.13	46.87	模型
30	GLM-4-9B-Chat	清华&智谱AI	61.15	66.5	71.08	45.88	模型
31	Gemma-2-9b-it	Google	60.93	63.41	72.54	46.85	模型
32	Qwen2-7B-Instruct	阿里巴巴	58.75	63.97	74.36	37.94	模型
33	XVERSE-65B-2-32K	元象科技	56.42	56.82	73.98	38.48	API
34	Yi-1.5-9B-Chat-16K	零一万物	55.2	56.01	69.81	39.79	模型
35	Llama-3.1-8B-Instruct	Meta	53.28	58.53	62.91	38.41	POE
36	Yi-1.5-6B-Chat	零一万物	51.93	56.6	64.22	34.97	模型
37	Gemma-2-2b-it	Google	48.93	48.3	64.64	33.84	模型
38	Phi-3-Mini-4K-Instruct	微软	42.67	48.91	49.69	29.43	模型
39	Mistral-7B-Instruct-v0.3	Mistral AI	39.05	39.45	54.19	23.5	模型
40	Qwen2-1.5B-Instruct	阿里巴巴	38.96	37.42	61.61	17.85	模型
41	Baichuan2-7B-Chat	百川智能	37.57	28.75	63.94	20.01	模型
42	RWKV-6-World-7B	RWKV开源基金会	33.45	28.24	56.63	15.49	模型
43	Gemma-2b-it	Google	29.74	24.73	48.74	15.74	模型

排名（开源）

排名	模型	机构	总分	理科得分	文科得分	Hard得分	参数量	使用方法
1	DeepSeek-V2-0628	深度求索	74.63	79.63	77.66	66.6	2360亿	API
2	DeepSeek-Coder-V2-0724	深度求索	74.01	80.19	76.96	64.87	2360亿	API
3	Qwen2-72B-Instruct	阿里巴巴	73.51	76.6	76.76	67.15	720亿	API
4	Mistral-Large-Instruct-2407	Mistral AI	70.62	73.71	71.19	66.98	1230亿	POE
5	Llama-3.1-405B-Instruct	Meta	66.5	73.7	68.89	56.92	4050亿	POE
6	Llama-3.1-70B-Instruct	Meta	63.72	67.39	68.18	55.6	700亿	POE
7	Yi-1.5-34B-Chat-16K	零一万物	62.02	66.07	73.13	46.87	340亿	模型
8	GLM-4-9B-Chat	清华&智谱AI	61.15	66.5	71.08	45.88	90亿	模型
9	Gemma-2-9b-it	Google	60.93	63.41	72.54	46.85	90亿	模型
10	Qwen2-7B-Instruct	阿里巴巴	58.75	63.97	74.36	37.94	70亿	模型
11	XVERSE-65B-2-32K	元象科技	56.42	56.82	73.98	38.48	650亿	API
12	Yi-1.5-9B-Chat-16K	零一万物	55.2	56.01	69.81	39.79	90亿	模型
13	Llama-3.1-8B-Instruct	Meta	53.28	58.53	62.91	38.41	80亿	POE
14	Yi-1.5-6B-Chat	零一万物	51.93	56.6	64.22	34.97	60亿	模型
15	Gemma-2-2b-it	Google	48.93	48.3	64.64	33.84	20亿	模型
16	Phi-3-Mini-4K-Instruct	微软	42.67	48.91	49.69	29.43	38亿	模型
17	Mistral-7B-Instruct-v0.3	Mistral AI	39.05	39.45	54.19	23.5	70亿	模型
18	Qwen2-1.5B-Instruct	阿里巴巴	38.96	37.42	61.61	17.85	15亿	模型
19	Baichuan2-7B-Chat	百川智能	37.57	28.75	63.94	20.01	70亿	模型
20	RWKV-6-World-7B	RWKV开源基金会	33.45	28.24	56.63	15.49	70亿	模型
21	Gemma-2b-it	Google	29.74	24.73	48.74	15.74	20亿	模型