Pytorch 2.0.1内存泄漏问题

nhjydywd0

已于 2024-02-02 18:35:57 修改

阅读量254

点赞数 1

文章标签： pytorch 人工智能 python

于 2024-02-02 18:34:30 首次发布

本文链接：https://blog.csdn.net/nhjydywd0/article/details/135994508

版权

文章描述了PyTorch2.0.1版本在处理包含大量不同shape数据时出现的内存泄漏问题，尤其是在推理阶段。作者怀疑是由于库对每个shape的tensor缓存导致内存占用过多，且暂无官方清理机制。开发者呼吁解决这一问题并希望得到社区的帮助。

摘要由CSDN通过智能技术生成

Pytorch 2.0.1内存泄漏问题

更新：经过多次测试，已经确信这个BUG只在2.0.x版本中存在，避免使用2.0.x版本即可。此贴完结。

当推理的数据含有大量不同的shape时，会导致内存泄漏。一段发生泄露的代码：

from torchvision.models import resnet
import torch
from memory_profiler import profile

net = resnet.resnet50(pretrained=True)
net = net.cuda()
net.train()

@profile(precision=4,stream=open('resnet.log','w'))
def infer(width, height):
    data = torch.randn(2, 3, width, height)
    x = data.clone().cuda()
    out = net(x)
    torch.cuda.empty_cache()

for width in range(100, 2000, 10):
    print(width)
    for height in range(100, 2000 ,10):
        infer(width, height)

用memory_profiler打印一下，内存占用直线上升：
在这里插入图片描述