常见GPU算力比较(历代游戏卡皇)

一、硬件参数

2080Ti30903090Ti40904090D50905090D
核心TU102-300AGA102-300GA102-350AD102-300AD102-250GB202-300GB202-250
架构TuringAmpereAmpereAda LovelaceAda LovelaceBlackwellBlackwell
SM688284128114170170
CUDA Cores / SM64128128128128128128
CUDA Cores / GPU4352104961075216384145922176021760
Tensor Core2nd3rd3rd4th4th5th5th
Tensor Cores / SM8444444
Tensor Cores / GPU544328336512456680680
GPU 加速频率 (MHz)1545169518602520252024072407
显存11 / 22 GB (GDDR6)*24 GB (GDDR6X)24 GB (GDDR6X)24 GB (GDDR6X)24 GB (GDDR6X)32 GB (GDDR7)32 GB (GDDR7)
显存位宽 (bit)352384384384384512512
显存速率 (Gbps)1419.52121212828
显存带宽 (GBps)616936.210081008100817921792
一缓 (KB per SM)64128128128128128128
二缓 (MB)66672729696
TGP (W)250350450450425575575
制程TSMC 12nm FFNSamsung 8N (8nm)Samsung 8N (8nm)TSMC 4N (5nm)TSMC 4N (5nm)TSMC 4N (5nm)TSMC 4N (5nm)

* 22 GB 是常见的手动扩显存的魔改卡

二、算力

1、CUDA Core 算力

浮点:TFLOPS

整型:TIOPS

取 4090 的算力为100%

2080Ti30903090Ti40904090D50905090D
FP3213.4535.5840.0082.673.5104.8104.8
FP1626.935.5840.0082.673.5104.8104.8
FP640.42020.5560.6251.291.1491.641.64
BF16NA35.5840.0082.673.5104.8104.8
INT3213.4517.7920.0041.336.8104.8104.8
2080Ti30903090Ti40904090D50905090D
FP3216.3%43.1%48.4%100%89.0%126.9%126.9%
FP1632.6%43.1%48.4%100%89.0%126.9%126.9%
FP6432.6%43.1%48.4%100%89.0%126.9%126.9%
BF16NA43.1%48.4%100%89.0%126.9%126.9%
INT3232.6%43.1%48.4%100%89.0%253.6%253.6%

2、Tensor Core 算力

浮点:TFLOPS

整型:TIOPS

稠密/稀疏

取 4090 的算力为100%

2080Ti30903090Ti40904090D50905090D*
FP4NANANANANA1676 / 3352NA / 2375
FP8NANANA660.6 / 1321.2588.4 / 1176.8838 / 1676NA / NA
FP16107.6142 / 284160 / 320330.3 / 660.6294.2 / 588.4419 / 838NA / NA
BF16NA71 / 14280 / 160165.2 / 330.4147.1 / 294.2209.5 / 419NA / NA
TF32NA35.6 / 7140 / 8082.6 / 165.273.5 / 147.1104.8 / 209.5NA / NA
INT8215.2284 / 568320 / 640660.6 / 1321.2588.4 / 1176.8838 / 1676NA / NA
INT4430.3568 / 1136640 / 12801321.2 / 2642.41176.8 / 2353.61676 / 3352NA / NA
2080Ti30903090Ti40904090D50905090D*
FP4NANANANANANANA
FP8NANANA100%89.0%126.9%NA
FP1632.6%43.1%48.4%100%89.0%126.9%NA
BF16NA43.1%48.4%100%89.0%126.9%NA
TF32NA43.1%48.4%100%89.0%126.9%NA
INT832.6%43.1%48.4%100%89.0%126.9%NA
INT432.6%43.1%48.4%100%89.0%126.9%NA

*5090D 的 Tensor Core 算力有待考证

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值