pytorch 交叉熵计算过程

最新推荐文章于 2024-04-29 19:42:42 发布

齐名南

最新推荐文章于 2024-04-29 19:42:42 发布

阅读量749

点赞数 1

分类专栏： python 文章标签： pytorch python 神经网络

本文链接：https://blog.csdn.net/qq_51609636/article/details/120313696

版权

python 专栏收录该内容

15 篇文章

订阅专栏

本文详细介绍了Pytorch中Softmax、Log_Softmax、NLLLoss以及CrossEntropyLoss的关系和区别，并通过代码示例展示了它们的计算过程。通过对输入数据和目标的处理，演示了损失函数的计算，最终得到的损失值与手动计算的结果一致。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

同样可以参考Pytorch中Softmax、Log_Softmax、NLLLoss以及CrossEntropyLoss的关系与区别详解_NeilPy的博客-CSDN博客

参考以上公式我们进行如下计算：

import torch.nn as nn      
import torch
loss = nn.CrossEntropyLoss()
input = torch.randn(3, 5, requires_grad=True)
print(input)
target = torch.empty(3, dtype=torch.long).random_(5)
print(target)
output = loss(input, target)
print(output)
output.backward()

程序结果：

input tensor([[ 0.8613, 0.2848, -0.9878, 1.6137, 1.6703],
[-0.5740, 0.6567, -0.7853, -1.5065, 1.3024],
[-1.2544, -0.7814, 0.0204, -0.7491, 0.3055]], requires_grad=True)
tensor([0, 2, 3])
loss:tensor(2.1812, grad_fn=<NllLossBackward>)

计算：

loss = -x[class]+log(exp(x[i])累加)

loss1=1.8061534287700232

listnum=[ 0.8613,  0.2848, -0.9878,  1.6137,  1.6703]
sum=0
for num in listnum:
    sum = sum+math.exp(num)
print(-listnum[0]+math.log(sum))

loss2=2.709178782095951

listnum=[-0.5740,  0.6567, -0.7853, -1.5065,  1.3024]
sum=0
for num in listnum:
    sum = sum+math.exp(num)
print(-listnum[2]+math.log(sum))
loss2=-listnum[2]+math.log(sum)

loss3=2.028286903303233

listnum=[-1.2544, -0.7814,  0.0204, -0.7491,  0.3055]
sum=0
for num in listnum:
    sum = sum+math.exp(num)
print(-listnum[3]+math.log(sum))
loss3=-listnum[3]+math.log(sum)

loss=（loss1+loss2+loss3）/3=2.181206371389736 和上面结果一样