深入浅出理解CrossEntropy

import torch
import numpy as np
import torch.nn.functional as F
x = np.array([[1, 2,3,4,5],
             [1, 2,3,4,5],
             [1, 2,3,4,5]]).astype(np.float32)
y = np.array([1, 1, 0])
x = torch.from_numpy(x)
y = torch.from_numpy(y).long()  #标签为long型
soft_out = F.softmax(x,dim=1)
log_soft_out1 = torch.log(soft_out)
print("log_soft_out1:{}".format(log_soft_out1))
log_soft_out2 = F.log_softmax(soft_out)
print("log_soft_out2:{}".format(log_soft_out1))
loss = F.nll_loss(log_soft_out1, y)
print(soft_out)
print(log_soft_out1)
print(loss)
loss = F.cross_entropy(x, y)
print(loss)

结果:

log_soft_out1:tensor([[-4.4519, -3.4519, -2.4519, -1.4519, -0.4519],
        [-4.4519, -3.4519, -2.4519, -1.4519, -0.4519],
        [-4.4519, -3.4519, -2.4519, -1.4519, -0.4519]])
log_soft_out2:tensor([[-4.4519, -3.4519, -2.4519, -1.4519, -0.4519],
        [-4.4519, -3.4519, -2.4519, -1.4519, -0.4519],
        [-4.4519, -3.4519, -2.4519, -1.4519, -0.4519]])
tensor([[0.0117, 0.0317, 0.0861, 0.2341, 0.6364],
        [0.0117, 0.0317, 0.0861, 0.2341, 0.6364],
        [0.0117, 0.0317, 0.0861, 0.2341, 0.6364]])
tensor([[-4.4519, -3.4519, -2.4519, -1.4519, -0.4519],
        [-4.4519, -3.4519, -2.4519, -1.4519, -0.4519],
        [-4.4519, -3.4519, -2.4519, -1.4519, -0.4519]])
tensor(3.7852)
tensor(3.7852)

结果分析:

F.softmax(x,dim=1):一行和为1 sum([0.0117, 0.0317, 0.0861, 0.2341, 0.6364])=1
softmax函数公式
torch.log(soft_out):对softmax的结果进行取对数
F.nll_loss(log_soft_out, y):对取对数的结果,根据y的值,(y值是索引),找到对应的值,黄色部分,各自取相反数再相加,求平均
(3.4519+3.4519+4.4519)/3 = 3.7852
所以:
cross_entropy函数:softmax->log->nll_loss
  • 1
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值