loss函数之NLLLoss，CrossEntropyLoss

最新推荐文章于 2024-08-21 00:00:00 发布

旺旺棒棒冰

最新推荐文章于 2024-08-21 00:00:00 发布

阅读量4.1k

点赞数 1

分类专栏：深度学习理论文章标签：负对数似然交叉熵损失 nllloss crossentropy

本文链接：https://blog.csdn.net/ltochange/article/details/117919900

版权

PyTorch NLLLoss CrossEntropyLoss 损失函数多分类

关键词由CSDN通过智能技术生成

深度学习理论专栏收录该内容

28 篇文章 1 订阅

订阅专栏

NLLLoss

负对数似然损失函数，用于处理多分类问题，输入是对数化的概率值。

对于包含 $N$ 个样本的batch数据 $D (x, y)$ ， $x$ 是神经网络的输出，进行了归一化和对数化处理。 $y$ 是样本对应的类别标签，每个样本可能是 $C$ 种类别中的一个。

$l_{n}$ 为第 $n$ 个样本对应的 $l o s s$ ， $\leq y_{n} \leq C-1$

$l_{n}=-w_{y_{n}} x_{n, y_{n}}$

$w e i g h t$ 用于处理多个类别之间样本不平衡问题：

$w_{c}=weight[c] \cdot 1\{c \neq$ ignore_index $\}$

class NLLLoss(_WeightedLoss):
    __constants__ = ['ignore_index', 'reduction']
    ignore_index: int
    def __init__(self, weight: Optional[Tensor] = None, size_average=None, ignore_index: int = -100,
                 reduce=None, reduction: str = 'mean') -> None:
        super(NLLLoss, self).__init__(weight, size_average, reduce, reduction)
        self.ignore_index = ignore_index
    def forward(self, input: Tensor, target: Tensor) -> Tensor:
        assert self.weight is None or isinstance(self.weight, Tensor)
        return F.nll_loss(input, target, weight=self.weight, ignore_index=self.ignore_index, reduction=self.reduction)

pytorch中通过torch.nn.NLLLoss类实现，也可以直接调用F.nll_loss 函数。size_average与reduce已经弃用。reduction有三种取值mean, sum, none，对应不同的返回 $\ell(x, y)$ 。默认为mean，对应于一般情况下整体 $l o s s$ 的计算。

$L=\left\{l_{1}, \ldots, l_{N}\right\}$

$\ell(x, y)=\left\{\begin{array}{ll} L, & \text { if reduction }=\text { 'none' } \\ \sum_{n=1}^{N} \frac{1}{\sum_{n=1}^{N} w_{y_{n}}} l_{n}, & \text { if reduction }=\text { 'mean' } \\ \sum_{n=1}^{N} l_{n}, & \text { if reduction }=\text { 'sum' }\end{array}\right.$

参数ignore_index对应于忽视的类别，即该类别的误差不计入 $l o s s$ , 默认为-100，例如，将padding处的类别设置为ignore_index

LogSoftmax

pytorch中使用torch.nn.LogSoftmax函数对神经网络的输出进行归一化和对数化

$\operatorname{LogSoftmax}\left(x_{i}\right)=\log \left(\frac{\exp \left(x_{i}\right)}{\sum_{j} \exp \left(x_{j}\right)}\right)$

CrossEntropyLoss

交叉熵损失函数，用于处理多分类问题，输入是未归一化神经网络输出。

$\text { CrossEntropyLoss }(x, y)=N L L \operatorname{Loss}(\log \operatorname{Softmax}(x), y)$

对于包含 $N$ 个样本的batch数据 $D (x, y)$ ， $x$ 是神经网络未归一化的输出。 $y$ 是样本对应的类别标签，每个样本可能是 $C$ 种类别中的一个。

$l_{n}$ 为第 $n$ 个样本对应的 $l o s s$ ， $\leq y_{n} \leq C-1$

$l_{n} =-w_{y_{n}}(\log \frac{\exp(x_{n, y_{n}})}{\sum_{j=0}^{C-1} \exp (x_{n,j})})$

class CrossEntropyLoss(_WeightedLoss):
    __constants__ = ['ignore_index', 'reduction']
    ignore_index: int
    def __init__(self, weight: Optional[Tensor] = None, size_average=None, ignore_index: int = -100,
                 reduce=None, reduction: str = 'mean') -> None:
        super(CrossEntropyLoss, self).__init__(weight, size_average, reduce, reduction)
        self.ignore_index = ignore_index
    def forward(self, input: Tensor, target: Tensor) -> Tensor:
        assert self.weight is None or isinstance(self.weight, Tensor)
        return F.cross_entropy(input, target, weight=self.weight,
                               ignore_index=self.ignore_index, reduction=self.reduction)

pytorch中通过torch.nn.CrossEntropyLoss类实现，也可以直接调用F.cross_entropy 函数。size_average与reduce已经弃用。reduction有三种取值mean, sum, none，对应不同的返回 $\ell(x, y)$ . 默认为mean，对应于一般情况下整体 $l o s s$ 的计算。

$L=\left\{l_{1}, \ldots, l_{N}\right\}$

验证 $\text { CrossEntropyLoss }(x, y)=N L L \operatorname{Loss}(\log \operatorname{Softmax}(x), y)$ ：

import torch
import torch.nn as nn

# 多分类
m = torch.nn.LogSoftmax(dim=1)
loss_nll_fct = nn.NLLLoss(reduction="mean")
loss_ce_fct = nn.CrossEntropyLoss(reduction="mean")
input_src = torch.Tensor([[0.8, 0.9, 0.3], [0.8, 0.9, 0.3], [0.8, 0.9, 0.3], [0.8, 0.9, 0.3]])
target = torch.Tensor([1, 1, 0, 0]).long()
# 4个样本，3分类
print(input_src.size())
print(target.size())
output = m(input_src)
loss_nll = loss_nll_fct(output, target)
print(loss_nll.item())
# 验证是否一致
loss_ce = loss_ce_fct(input_src, target)
print(loss_ce.item())

torch.Size([4, 3])
torch.Size([4])
0.9475762844085693
0.9475762844085693

旺旺棒棒冰

关注

1
点赞
踩
7

收藏

觉得还不错? 一键收藏
打赏
2
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录