PyTorch模型训练梯度反向传播遇到的几个报错解决办法

最新推荐文章于 2024-06-19 02:03:38 发布

森尼嫩豆腐

最新推荐文章于 2024-06-19 02:03:38 发布

阅读量7k

点赞数 2

分类专栏：实用工具文章标签： python debug pytorch

本文链接：https://blog.csdn.net/lavinia_chen007/article/details/118573825

版权

本文主要介绍了在使用PyTorch进行模型训练时遇到的两个梯度反向传播报错：RuntimeError和IndexError。针对RuntimeError，检查发现loss的requires_grad为False，修改代码解决问题。对于IndexError，原因是尝试访问0-dim tensor的索引，通过修正代码消除错误。最后提出，可能是模型中某个步骤导致outputs被detach，需检查模型以确保正确计算梯度。

摘要由CSDN通过智能技术生成

文章目录

相关代码
报错1：RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
报错2：IndexError: invalid index of a 0-dim tensor. Use `tensor.item()` in Python or `tensor.item()` in C++ to convert a 0-dim tensor to a number
更新解决办法

这篇是关于PyTorch模型训练时两个报错信息及解决方法的整理。

相关代码

loss的定义

class CrossEntropyLoss2d(torch.nn.Module):

    def __init__(self, weight=None):
        super().__init__()

        self.loss = torch.nn.NLLLoss(weight)

    def forward(self, outputs

最低0.47元/天解锁文章

森尼嫩豆腐

关注

2
点赞
踩
12

收藏

觉得还不错? 一键收藏
2
评论
PyTorch模型训练梯度反向传播遇到的几个报错解决办法

文章目录相关代码报错1：RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn报错2：IndexError: invalid index of a 0-dim tensor. Use `tensor.item()` in Python or `tensor.item()` in C++ to convert a 0-dim tensor to a number这篇是关于PyTorch模型训练时
复制链接

扫一扫

专栏目录