交叉熵损失函数

bleedingfight

已于 2022-07-30 23:06:35 修改

阅读量789

点赞数

分类专栏：计算机视觉文章标签：深度学习 tensorflow python

于 2021-02-24 01:12:49 首次发布

本文链接：https://blog.csdn.net/bleedingfight/article/details/114004564

版权

计算机视觉专栏收录该内容

14 篇文章 0 订阅

订阅专栏

交叉熵损失函数

多分类的交叉熵损失函数：
$\sum_{i=1}^cy_i\cdot\log(\hat{y})\quad y_i = softmax(x_i)=\frac{e^{x_i}}{\sum_{i=1}^ce^{x_i}}$

TensorFlow计算多分类交叉熵损失

Tensorflow中的交叉熵损失函数：tf.nn.softmax_cross_entropy_with_logits在PyTorch中损失函数为：torch.nn.CrossEntropyLoss
对于如下数据：
$\quad \text{labels表示真实输出}y\\ logits = [0.0, 5.0, 1.0]\quad \text{logits表示网络的输出}\hat{y}$
可以看到真实输出里面的元素大于0,所以需要用softmax处理为概率形式：计算:
$so f t ma x (l o g i t s) = [0.00657326, 0.9755587, 0.01786798]$ ，那么交叉熵损失函数为： $tf.reduce\_sum(labels*tf.math.log(tf.nn.softmax(logits)))=0.82474494$

使用TensorFlow交叉熵损失函数计算：

labels=[0.0,0.8,0.2]
logits=[0.0,5.0,1.0]
-tf.nn.softmax_cross_entropy_with_logits(labels,logits) # 0.82474494

在实际的分类任务中labels通常是one-hot编码的结果：例如[0,1,0](表示输出类别为1)，那么对于如下的数据：

labels = [[0,1,0]]
logits = [0,5.,1]

使用Tensorflow计算：

import tensorflow as tf
device = tf.config.get_visible_devices()
tf.config.experimental.get_memory_growth(device[1])

labels = tf.constant([[0,1,0]],dtype=tf.float32)
logits = tf.constant([[0,5.,1]])
origin_output = -tf.reduce_sum(labels*tf.math.log(tf.nn.softmax(logits))) 
print(origin_output.numpy(),tf.nn.softmax_cross_entropy_with_logits(labels,logits).numpy(),tf.nn.sparse_softmax_cross_entropy_with_logits(tf.argmax(labels,axis=1),logits).numpy())

结果如下tf.nn.sparse_softmax_cross_entropy_with_logits计算只是将one-hot转换为了索引表示的实际class值)：

0.024744948 [0.02474492] [0.02474492]

PyTorch计算多分类交叉熵损失

PyTorch使用了另一种表示方法：

$\text{loss}(x, class) = -\log\left(\frac{\exp(x[class])}{\sum_j \exp(x[j])}\right)\\ = -x[class] + \log\left(\sum_j \exp(x[j])\right)$
对于上面的输出 $\hat{y}=[0.0,5.0,1.0]$ 表示1出现的概率最大，输出为1。即 $x [c l a ss] = x [1] = l ab e l s [1] = 0.8$ 计算结果为： $-0.8+log\sum_{i=1}^3e^{x[i]}=0.6922$
原始计算：

labels = torch.tensor([[0.0,0.8,0.2]])
logits = torch.Tensor([[0.0,5.0,1.0]])
loss_value = -labels[0][torch.argmax(logits)]+torch.log(torch.sum(torch.exp(labels),axis=1))
print(loss_value) #

直接计算：

loss = torch.nn.CrossEntropyLoss()
logits = torch.Tensor([[0.0,5.0,1.0]])
labels = torch.Tensor([[0.0,0.8,0.2]])
loss_value = loss(labels,y)
print(loss_value) # 0.6922

对于上述例子：

import torch
one_hot_labels = torch.Tensor([[0,1,0]])
labels = torch.argmax(one_hot_labels,axis=1)
# labels = torch.tensor([1])
logits = torch.Tensor([[0,5,1]])
loss1 = torch.nn.CrossEntropyLoss()(logits,labels)
loss2 = torch.nn.NLLLoss()(torch.nn.LogSoftmax(dim=-1)(logits), labels)
print(loss1,loss2)

完整验证：

import torch
from torch import nn 
def logsoftmax(data):
    return torch.log(torch.softmax(data,dim=1))
def nlloss(data,labels):
    res = 0
    for index,label_num in enumerate(labels):
        res+=data[index][label_num]
    return -res/len(data)
        
def cross_entropy(data,label):
    loss_value = nlloss(logsoftmax(data),labels)
    return loss_value
class_num = 5
data = torch.randn(3,class_num,dtype=torch.float)
labels = torch.randint(0,class_num,size=(3,))

t_logsoftmax = nn.LogSoftmax(dim=-1)(data)
m_logsoftmax = logsoftmax(data)
print(t_logsoftmax,m_logsoftmax)
t_nlloss = nn.NLLLoss()(t_logsoftmax,labels)
m_nlloss = nlloss(m_logsoftmax,labels)

print(t_nlloss)
print(m_nlloss)

loss = nn.CrossEntropyLoss()
t_loss = loss(data,labels)
m_loss = cross_entropy(data,labels)
print("交叉熵损失：{:.4f}(PyTorch) {:.4f}(My)".format(t_loss,m_loss))

bleedingfight

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
交叉熵损失函数

交叉熵损失函数多分类的交叉熵损失函数：CrossEntropy(x)=∑i=1cyi⋅log⁡(y^)yi=softmax(xi)=exi∑i=1cexiCrossEntropy(x) = \sum_{i=1}^cy_i\cdot\log(\hat{y})\quad y_i = softmax(x_i)=\frac{e^{x_i}}{\sum_{i=1}^ce^{x_i}}CrossEntropy(x)=i=1∑cyi⋅log(y^)yi=softmax(xi)=∑i=1cexiexi
复制链接

扫一扫

专栏目录