PytorchRuntimeError:CUDA out of memory.Tried to allocate 120.00 MiB (GPU 0； 22.38 GiB total capacity

最新推荐文章于 2024-04-24 16:26:12 发布

凝眸伏笔

最新推荐文章于 2024-04-24 16:26:12 发布

阅读量3.6k

点赞数

分类专栏： pytorch 文章标签：深度学习 cuda

本文链接：https://blog.csdn.net/pearl8899/article/details/109540573

版权

pytorch 专栏收录该内容

23 篇文章 16 订阅

订阅专栏

问题现场

环境：pytorch=1.+，python=3.6，1个GPU

当对预训练的bert模型进行fine-tuning，模型训练的batchsize设置为256时，报错。大概意思是GPU内存超了，总共22g(模型结构、参数等会占用5g内存)内存，16g的内存以及被分配出去了，剩下的78M，不够分配给120M了。

RuntimeError: CUDA out of memory. Tried to allocate 120.00 MiB (GPU 0; 22.38 GiB total capacity; 
16.74 GiB already allocated; 78.06 MiB free; 16.85 GiB reserved in total by PyTorch)

解决方案

1.调小一些batchsize。

举个例子，来计算下batchsize跟占用内存之间的关系：

2.“变相”增加batchsize的大小。

补充：

模型训练时的信息，每张卡，受限于内存，能接受的最大batchsize数是128，这里可以看下内存占用情况。

参考：

1.调小batchsiize：https://github.com/pytorch/pytorch/issues/16417

2.训练trick：https://github.com/huggingface/transformers/issues/906

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

凝眸伏笔

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
2
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

成功解决torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 86.00 MiB (GPU 0； 2.00 GiB to

近期请国内外头部出版社可尽快私信博主！——心比天高，仗剑走天涯，保持热爱，奔赴向梦想！低调，谦虚，自律，反思，成长，还算是比较正能量的博主，公益免费传播……内心特别想在AI界做出一些可以推进历史进程影响力的东西(兴趣使然，有点小情怀，也有点使命感呀)…

06-02

1万+

成功解决torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 86.00 MiB (GPU 0; 2.00 GiB total capacity; 1.67 GiB already allocated; 0 bytes free; 1.67 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_s

RuntimeError: CUDA out of memory. Tried to allocate 16.00 MiB (GPU 0； 2.00 GiB total capacity； 1.34

风口IT猪的成长录

08-04

5220

RuntimeError: CUDA out of memory.1. with torch.no_grad()2. os.environ["CUDA_VISIBLE_DEVICES"]3. torch.cuda.empty_cache()4. batch_size=1 1. with torch.no_grad() 注意pytorch在test时，一定要加上: with torch.no_grad(): outputs = Net_(inputs) ---错误代码的位置。不计算梯度，否则会使显存加倍

2 条评论您还未登录，请先登录后发表或查看评论

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 12.00 MiB (GPU 0; 1.96 GiB total ...

weixin_35755823的博客

02-03

1万+

这是一个CUDA内存错误，代表GPU内存不足，无法分配12.00 MiB的内存。您可以尝试设置max_split_size_mb以避免内存碎片，以获得更多的内存。请参考PyTorch的内存管理文档以获得更多信息和PYTORCH_CUDA_ALLOC_CONF的配置。 ...

OutOfMemoryError: CUDA out of memory.Tried to allocate 128.00 MiB.......却未占用显存，个人方法简单实用

xuanjiong的博客

03-03

2384

笔者的显存大小为6g,但没有占用gpu，猜测是显存过小导致的问题。笔者直接在命令行运行相关python文件，因此找到目标文件。在右下角添加quantize(4)，就保证代码的正常运行。

CUDA error:out of memory

Arcobaleno

05-16

3万+

今天在运行程序的时候，一直跟我报这个错误，说我CUDA内存不足。调试了很久，最后发现竟然是这样····· 刚开始我怀疑是服务器上的显卡被人用了，但是当我mvidia-smi的时候发现3块GPU都没人用。。那这个问题显然是不可能了。那为何会这样呢？又有人说是TensorFlow和Pytorch的版本冲突。？？？我并没有搞到TensorFlow啊最后参考了该帖子：http:...

torch.cuda.OutOfMemoryError: CUDA out of memory.

m0_72572822的博客

09-26

3926

CUDA out of memory

RuntimeError: CUDA out of memory. Tried to allocate … MiB

R1ck_harme的博客

06-19

489

Bug修改记录

[881]内存不足RuntimeError: CUDA out of memory. Tried to allocate 16.00 MiB (GPU 0； 2.00 GiB total cap...

sinat_42367115的博客

04-21

5404

今天第一次在服务器上跑代码也是第一次接触cuda显存卡碰到了标题问题将解决办法记录下来 1.首先去查看cuda显存占用情况： nvidia-smi 显示所有GPU的当前信息状态（图源网络不想再去截图啦介意可删）发现第二列其实占用没满接着考虑第二种情况 2. 如果是训练中可考虑将batch_size调小一些如果在测试集中在代码问题的上一行加入withtorch.no_grad(): 3.考虑训练图片分辨率太高换数据集进行测试或者将图片大小改小一点亲测有效！.

解决RuntimeError: CUDA out of memory. Tried to allocate 14.00 MiB (GPU 0; 7.43 GiB total capacity; 6.3

热门推荐

学-> 思->用

02-19

5万+

pytorch 训练问题RuntimeError: CUDA out of memory. RuntimeError: CUDA out of memory. Tried to allocate 26.00 MiB (GPU 0; 7.43 GiB total capacity; 5.46 GiB already allocated; 18.44 MiB free; 6.83 GiB reser...

RuntimeError: CUDA out of memory. Tried to allocate 32.00 MiB (GPU

weixin_43639369的博客

02-24

9874

RuntimeError: CUDA out of memory. Tried to allocate 32.00 MiB (GPU）1. 问题描述2. 解决办法 1. 问题描述 Pytorch，GPU显存明明够用，为什么还报错呢？发现此时减小batch_size 同样是没用的。根本原因是代码指定的GPU与实际使用的GPU不一致。你以为代码在1上跑，实际上是在已经有代码运行的其他gpu跑，因此显示显存不足。 2. 解决办法让自己指定的gpu与实际使用的对应一致即可。比如，现在0，2，3号GPU已经在

Yolov8模型训练报错：torch.cuda.OutOfMemoryError

大雾的小屋的博客

11-10

5423

最近在使用自己的数据训练Yolov8模型的时候遇到了很多错误，下面将逐一解答。如何解决：torch.cuda.OutOfMemoryError的报错问题。

【新手】复现NeRCo代码中出现的torch.cuda.OutOfMemoryError: CUDA out of memory. 问题解决办法。

qq_43612410的博客

09-13

2149

3、高版本pytorch可能在处理显存占用时有更多的优化，可以升级为更高版本的pytorch。特别是在相同型号GPU的情况下，可以考虑Python包版本的问题，通过“pip list”或者“conda list”可以参考版本信息。对于1，在尝试减少batchsize解决，发现原文中原本的batchsize已经是1，没有办法再减了。但是此方法只是应急用，至于怎么让他多个GPU一起用，我至今还没有弄明白，在参数前面加上。2、cached（缓存）过高的情况下，在报错的代码块之前先添加。也还是会报显存不够的错误.

解决加载torch模型时出现CUDA out of memory

weixin_44424296的博客

09-18

1万+

解决加载torch模型时出现CUDA out of memory 正常来说出现“CUDA out fo memory”是CUDA内存不够出现的bug。事情是这样滴，我训练完一个模型之后，加载的时候并没有把模型加载到gpu，但是还是报错，代码如下： from transformers import BertForSequenceClassification, AdamW pretrained = 'bert-base-chinese' model = BertForSequenceClassificat

pytorch运行错误：CUDA out of memory. [已解决]

qq_34907927的博客

11-21

2万+

在2080ti上运行分类模型时，遇到了该问题，检查模型没有发现问题，最终确认是验证机评估阶段的张量计算非常占用空间。可以对利用torch.tensor().detach().cpu().numpy()转为numpy进行loss和acc的计算直接对评估阶段使用with torch.no_grad(): for step, (img, label) in enumerate(dataloader): ...... if (step + 1) %

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 50.00 MiB (GPU 0； 23.69 GiB total

BetrayFree的博客

10-27

1万+

错误消息提供了一些有关当前GPU内存状态的信息，以及一些建议的解决方法。

RuntimeError: CUDA out of memory. Tried to allocate ... MiB & Pytorch模型测试时显存一直上升导致爆显存

qq_44660426的博客

03-15

2704

RuntimeError: CUDA out of memory. Tried to allocate ... MiB & Pytorch模型测试时显存一直上升导致爆显存

CUDA out of memory 报错解决方案

m0_47867419的博客

04-24

3439

介绍了CUDA out of memory的一种解决办法

问题 sr failed: CUDA out of memory. Tried to allocate 解决

qyhua的专栏

01-19

2671

问题 sr failed: CUDA out of memory. Tried to allocate 解决

RuntimeError: CUDA out of memory.

好人一生快乐

04-05

799

矩池云出现 RuntimeError: CUDA out of memory. Tried to allocate 958.00 MiB解决办法

utOfMemoryError: CUDA out of memory. Tried to allocate 214.00 MiB. GPU