pytorch学习笔记——CrossEntropyLoss、NLLLoss、Softmax和LogSoftmax之间的关系

最新推荐文章于 2024-07-27 20:20:27 发布

xrn1997

最新推荐文章于 2024-07-27 20:20:27 发布

阅读量274

点赞数

分类专栏： Python学习笔记文章标签： pytorch python 深度学习

本文链接：https://blog.csdn.net/xrn1997/article/details/118900979

版权

Python学习笔记专栏收录该内容

4 篇文章 0 订阅

订阅专栏

import torch.nn as nn
import torch

data = torch.Tensor([[-0.2733, 0.3222, 0.2605],
                     [1.5393, 1.1688, -0.0975],
                     [0.3943, 0.5172, -0.9425]])  # 一个3*3的矩阵
print(data)
'''
程序运行的一次结果
tensor([[-0.2733,  0.3222,  0.2605],
        [ 1.5393,  1.1688, -0.0975],
        [ 0.3943,  0.5172, -0.9425]])
'''
sm = nn.Softmax(dim=1)  # 按行 softmax
print(sm(data))
'''
程序运行的一次结果
tensor([[0.2213, 0.4014, 0.3774],
        [0.5305, 0.3663, 0.1032],
        [0.4178, 0.4725, 0.1098]])
'''
print(torch.log(sm(data)))  # log(softmax)
'''
程序运行的一次结果
tensor([[-1.5084, -0.9129, -0.9746],
        [-0.6339, -1.0044, -2.2707],
        [-0.8728, -0.7498, -2.2095]])
'''
slm = nn.LogSoftmax(dim=1)
print(slm(data))  # LogSoftmax
'''
程序运行的一次结果
tensor([[-1.5084, -0.9129, -0.9746],
        [-0.6339, -1.0044, -2.2707],
        [-0.8728, -0.7498, -2.2095]])
'''
# 结论：nn.LogSoftmax = torch.log(nn.Softmax)

loss = nn.NLLLoss()
target1 = torch.tensor([0, 1, 2])  # 随便写一个目标tensor
print(loss(data, target1))  # NLLLoss原始损失
'''
程序运行的一次结果
tensor(0.0157)
(0.2733+0.9425-1.1688)/3 ≈ 0.0157
'''
print(loss(slm(data), target1))  # NLLLoss对LogSoftmax处理后的数据的损失
'''
程序运行的一次结果
tensor(1.5741) 
(1.5084+1.0044+2.2095)/3=1.5741
'''
loss2 = nn.CrossEntropyLoss()
print(loss2(data, target1))  # CrossEntropyLoss损失
'''
程序运行的一次结果
tensor(1.5741)
'''
# 结论：nn.CrossEntropyLoss(input, target1) = nn.NLLLoss(nn.LogSoftmax(input), target1)

target2 = torch.tensor([[1, 0, 0], [0, 1, 0], [0, 0, 1]])  # one-hot 标签
custom_loss = -torch.sum(slm(data) * target2) / 3
print(custom_loss)
'''
程序运行的一次结果
tensor(1.5741)
'''

下面是nn.NLLLoss()函数的公式。
$ℓ(x,y)=L={l_1,…,l_N}^⊤ \\ l_n=−w_{yn}x_{n,y_n}, w_c=weight[c]⋅1\{c\neq ignore\_index\} \\$
$ℓ(x,y)=\begin{cases} \sum_{n=1}^{N} \frac{l_n}{\sum^{N}_{n=1}w_{yn}},if \quad reduction='mean';\\ \sum_{n=1}^{N} l_n,\qquad \quad if \quad reduction='sum'. \end{cases}$
1.默认情况下weight为1，上述代码中，如果NLLLoss有weight参数，那么weight=torch.Tensor([1, 1, 1]),即上述代码中nn.NLLLoss(weight=torch.Tensor([1, 1, 1]))与nn.NLLLoss()等价。
2.默认情况下，nn.NLLLoss()的reduction为mean。

xrn1997

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
0
评论
pytorch学习笔记——CrossEntropyLoss、NLLLoss、Softmax和LogSoftmax之间的关系

import torch.nn as nnimport torchdata = torch.randn(3, 3) # 随机生成一个3*3的矩阵print(data)sm = nn.Softmax(dim=1) # 按行 softmaxprint(sm(data))print(torch.log(sm(data))) # log(softmax)slm = nn.LogSoftmax(dim=1)print(slm(data)) # LogSoftmax# 结论：nn.L
复制链接

扫一扫