CrossEntropyLoss,BCEWithLogitsLoss,BCEWithLogitsLoss细节(给不同的样本赋予不同的权重；给不同类别的样本赋予不同的权重)

最新推荐文章于 2024-05-27 14:36:12 发布

ystsaan

最新推荐文章于 2024-05-27 14:36:12 发布

阅读量2.4k

点赞数 1

分类专栏： pytorch

本文链接：https://blog.csdn.net/weixin_42388228/article/details/109090431

版权

本文详细解析了torch.nn.CrossEntropyLoss和BCEWithLogitsLoss中weight参数的区别，指出CrossEntropyLoss的weight对应类别权重，而BCELoss的weight对应样本权重。还讨论了在网络最后一层为softmax时，不应使用CrossEntropyLoss的原因，并提出了如何通过赋予不同样本和类别权重来应对类别不平衡问题，如GHM Loss和Focal Loss的应用。

摘要由CSDN通过智能技术生成

torch.nn.BCELoss，torch.nn.BCEWithLogitsLoss中的weight参数和torch.nn.CrossEntropyLoss的weight参数意义不一样，torch.nn.CrossEntropyLoss，torch.nn.BCEWithLogitsLoss的weight是每个class的权重，torch.nn.BCELoss中的weight是每个样本的权重；torch.nn.BCEWithLogitsLoss对于输入都要做一个sigmoid操作再做average(-ylogx-(1-y)log(1-x)，这里的x指sigmoid(输入)，和torch.nn.CrossEntropyLoss相似，torch.nn.CrossEntropyLoss对于输入都要做一个softmax操作再做average(-ylogx),这里x指softmax(输入)

这里主要详细说明torch.nn.CrossEntropyLoss
https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html#torch.nn.CrossEntropyLoss上说torch.nn.CrossEntropyLoss combines nn.LogSoftmax() and nn.NLLLoss()(即torch.nn.CrossEntropyLoss是nn.LogSoftmax()和nn.NLLLoss()两个操作的结合)
在这里插入图片描述
根据上面公式可知torch.nn.CrossEntropyLoss计算一个样本的损失是先对输入做了softmax再做log，但一般softmax这个操作在自己的写的网络的最后一步就有了，在有softmax层的情况下再使用torch.nn.CrossEntropyLoss就对softmax层前的logits进行了两次softmax再进行log计算损失，所以在网络的最后一层为softmax层的时候，不能用torch.nn.CrossEntropyLoss，需要修改

loss_logsoftmax=torch.nn.CrossEntropyLoss()
loss=torch.nn.NLLLoss()
input=tensor([[-0.3209,  0.0796],
        [-1.1043,  0.3333],
        [ 1.1423,  1.5803]])
input_softmax=torch.nn.Softmax(dim

最低0.47元/天解锁文章

ystsaan

关注

1
点赞
踩
10

收藏

觉得还不错? 一键收藏
0
评论
CrossEntropyLoss,BCEWithLogitsLoss,BCEWithLogitsLoss细节(给不同的样本赋予不同的权重；给不同类别的样本赋予不同的权重)

torch.nn.BCELoss，torch.nn.BCEWithLogitsLoss中的weight参数和torch.nn.CrossEntropyLoss的weight参数意义不一样，torch.nn.CrossEntropyLoss，torch.nn.BCEWithLogitsLoss的weight是每个class的权重，torch.nn.BCELoss中的weight是每个样本的权重；torch.nn.BCEWithLogitsLoss对于输入都要做一个sigmoid操作再做average(-ylo
复制链接

扫一扫

专栏目录