Llama/Qwen/DeepSeek开源之争——CLiB开源大模型排行榜：03.05医疗领域

easyllm

于 2025-03-28 08:45:00 发布

阅读量1.7k

点赞数 31

分类专栏：大模型评测【开源篇】文章标签： llama 开源 DeepSeek AI大模型评测 AI大模型技术医疗大模型 AI医疗行业应用

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/easyllm/article/details/146514523

版权

大模型评测【开源篇】专栏收录该内容

8 篇文章

订阅专栏

开源模型综合能力见：Llama/Qwen/DeepSeek开源之争——CLiB开源大模型排行榜03.04。

以下为医疗领域排行榜：

输出价格单位：（元/M tok）

排名	大模型	机构	输出价格	医疗
1	DeepSeek-R1	深度求索	16	85.5
2	qwq-32b-preview	阿里巴巴	7	78.7
3	qwen2.5-72b-instruct	阿里巴巴	12	77.74
4	Meta-Llama-3.1-405B-Instruct	Meta	21	77.34
5	deepseek-chat-v3	深度求索	8	76.06
6	qwen2.5-32b-instruct	阿里巴巴	7	74.64
7	DeepSeek-R1-Distill-Qwen-32B	深度求索	1.3	72.83
8	qwen2.5-14b-instruct	阿里巴巴	6	72.43
9	internlm2_5-20b-chat	上海人工智能实验室	1	71.87
10	internlm2_5-7b-chat	上海人工智能实验室	0.4	71.12
11	Llama-3.1-Nemotron-70B-Instruct-fp8	nvidia	2.2	69.27
12	qwen2.5-7b-instruct	阿里巴巴	2	68.52
13	Llama-3.3-70B-Instruct	meta	4.1	68.39
14	Llama-3.3-70B-Instruct-fp8	meta	2.2	67.88
15	DeepSeek-R1-Distill-Qwen-14B	深度求索	0.7	67.2
16	DeepSeek-R1-Distill-Llama-70B	深度求索	4.1	62.27
17	glm-4-9b-chat	智谱AI	0.6	61.94
18	Hermes-3-Llama-3.1-405B	NousResearch	5.8	58.78
19	qwen2.5-3b-instruct	阿里巴巴	0	54.09
20	phi-4	微软	1	47.81
21	qwen2.5-1.5b-instruct	阿里巴巴	0	47.28
22	Llama-3.1-8B-Instruct	Meta	0.4	46.9
23	gemma-2-27b-it	Google	1.3	46.47
24	gemma-2-9b-it	Google	0.6	45.66
25	Meta-Llama-3.1-8B-Instruct-fp8	meta	0.4	44.63
26	Llama-3.2-3B-Instruct	meta	0.2	41.81
27	Mistral-Nemo-Instruct-2407	Mistral	0.6	40.14
28	qwen2.5-0.5b-instruct	阿里巴巴	0	33.59
29	DeepSeek-R1-Distill-Llama-8B	深度求索	0.4	32.73
30	DeepSeek-R1-Distill-Qwen-7B	深度求索	0.4	31.27
31	Mistral-7B-Instruct-v0.3	Mistral	0.4	29.59
32	Llama-3.2-1B-Instruct	meta	0.2	28.91
33	DeepSeek-R1-Distill-Qwen-1.5B	深度求索	0.1	27.46
34	qwen2.5-math-72b-instruct	阿里巴巴	12	/
35	Yi-1.5-34B-Chat	零一万物	1.3	/
36	Yi-1.5-9B-Chat	零一万物	0.4	/

医疗领域目前囊括3个维度：医师考试-规培结业，医师考试-执业助理医师，医师考试-执业医师。其中规培结业含外科、皮肤科等18个方向，执业助理医师含临床执业助理医师、口腔执业助理医师等5个方向，执业医师含中西医结合执业医师、公共卫生执业医师等5个方向。

完整评测结果详见：https://github.com/jeinlee1991/chinese-llm-benchmark

往期文章

关于大模型评测EasyLLM：https://easyllm.site

最全——全球最全大模型产品评测平台，已囊括~200个大模型
最新——日更各个大模型各项能力指标评测，输出排行榜
最方便——无需注册/梯子，国内外各个大模型可一键评测
结果可见——所有大模型评测的方法、题集、过程、得分结果，可见可追溯！

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。