解决RuntimeError: one of the variables needed for gradient computation has been modified by an inplace

shakebalabala

已于 2024-01-31 16:01:46 修改

阅读量1.2k

点赞数 13

文章标签： python 运维深度学习 pytorch

于 2024-01-02 17:12:18 首次发布

本文链接：https://blog.csdn.net/shakebalabala/article/details/134394054

版权

跑pytorch模型报错：

RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [720, 64, 36, 36]], which is output 0 of TanhBackward0, is at version 1; expected version 0 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

根据[720, 64, 36, 36]，定位模型报错部分：

上网搜了一下，由于使用了分布式训练，在模型中使用Y +=X的操作时容易在进行此操作前，数据被修改。因此只要将Y=Y+X改为Y=Y.clone()+X即可