训练网络的时候遇到下面的问题:
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [1, 256, 1, 1]] is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).
有很多各种各样的经验帖子,什么relu改成inplace=True,还有跟分布式训练有关系。在这篇博客里面也提到减低torch版本有用
我遇到的问题也是torch版本不匹配
降低torch版本有用的根本原因在于链接
大佬的博客
如果有帮助可以评论哦。