多标签多分类损失计算 - BCEWithLogitsLoss详解

想胖的壮壮

于 2024-07-27 17:33:11 发布

阅读量995

点赞数 18

文章标签：分类数据挖掘人工智能

本文链接：https://blog.csdn.net/weixin_47552266/article/details/140738577

版权

`nn.BCEWithLogitsLoss`

在深度学习的多标签多分类任务中，nn.BCEWithLogitsLoss 是一个常用的损失函数。它结合了二元交叉熵损失（Binary Cross Entropy Loss）和 sigmoid 激活函数，能够有效地计算每个类别的概率并与目标值进行比较，从而计算损失。本文将详细介绍 nn.BCEWithLogitsLoss 的计算过程、各个参数及其作用，并通过具体的代码示例加以说明。

计算过程

nn.BCEWithLogitsLoss 的计算过程可以分为以下几个步骤：

Sigmoid Activation: 对输入的 logits 进行 sigmoid 激活，使其变为 [0, 1] 之间的概率值。
Binary Cross Entropy: 使用二元交叉熵损失公式计算损失。

给定输入 logits $x$ 和目标值 $y$ ，nn.BCEWithLogitsLoss 的计算公式为：

$\text{BCEWithLogitsLoss}(x, y) = - \frac{1}{N} \sum_{i=1}^N \left[ y_i \log(\sigma(x_i)) + (1 - y_i) \log(1 - \sigma(x_i)) \right]$

其中， $\sigma(x)$ 表示 sigmoid 激活函数：

$\sigma(x) = \frac{1}{1 + e^{-x}}$

参数详解

nn.BCEWithLogitsLoss 提供了多种参数，以灵活地适应不同的应用场景：

weight (Tensor, optional): 每个样本的权重。默认值为 None。这个参数用于在计算损失时对每个样本进行加权。
size_average (bool, optional, deprecated): 如果为 True，损失将会被平均，否则会被求和。默认值为 True。注意，这个参数已经被弃用，推荐使用 reduction 参数。
reduce (bool, optional, deprecated): 如果为 True，损失将会被求和。默认值为 True。注意，这个参数已经被弃用，推荐使用 reduction 参数。
reduction (str, optional): 指定应用于输出的归约方式，可以是 'none'、'mean' 或 'sum'。默认值为 'mean'。这个参数决定了损失是逐元素计算、求平均还是求和。

示例代码

以下是详细说明每个参数作用的示例代码。

import torch

# 定义 Sigmoid 激活函数
def sigmoid(x):
    return 1 / (1 + torch.exp(-x))

# 定义二元交叉熵损失函数
def binary_cross_entropy(pred, target):
    epsilon = 1e-12  # 避免 log(0) 的数值问题
    pred = torch.clamp(pred, epsilon, 1. - epsilon)
    return - (target * torch.log(pred) + (1 - target) * torch.log(1 - pred))

# 定义加权损失函数
def weighted_bce_loss(logits, target, weight=None, reduction='mean'):
    # 计算 Sigmoid 激活
    probabilities = sigmoid(logits)
    # 计算二元交叉熵损失
    bce = binary_cross_entropy(probabilities, target)
    
    # 如果设置了权重，则加权
    if weight is not None:
        bce = bce * weight
    
    # 根据 reduction 参数进行归约
    if reduction == 'mean':
        return bce.mean()
    elif reduction == 'sum':
        return bce.sum()
    elif reduction == 'none':
        return bce
    else:
        raise ValueError(f"Invalid reduction type: {reduction}")

# 模拟数据
logits = torch.tensor([[0.5, -1.2, 3.0], [1.5, 2.0, -0.5]], requires_grad=True)
targets = torch.tensor([[1, 0, 1], [0, 1, 0]], dtype=torch.float32)

# 示例1：默认情况下使用 nn.BCEWithLogitsLoss
loss_default = weighted_bce_loss(logits, targets)
print(f'Default Loss: {loss_default.item()}')

# 示例2：使用 weight 参数
weights = torch.tensor([0.7, 0.2, 0.1])  # 为每个类别设置不同的权重
loss_weighted = weighted_bce_loss(logits, targets, weight=weights)
print(f'Weighted Loss: {loss_weighted.item()}')

# 示例3：使用 reduction='none' 参数
loss_no_reduction = weighted_bce_loss(logits, targets, reduction='none')
print(f'No Reduction Loss: {loss_no_reduction}')

# 示例4：使用 reduction='sum' 参数
loss_sum_reduction = weighted_bce_loss(logits, targets, reduction='sum')
print(f'Sum Reduction Loss: {loss_sum_reduction.item()}')

# 示例5：使用 reduction='mean' 参数
loss_mean_reduction = weighted_bce_loss(logits, targets, reduction='mean')
print(f'Mean Reduction Loss: {loss_mean_reduction.item()}')

在多标签多分类任务中的应用

在多标签多分类任务中，每个样本可能属于多个类别。例如，对于一个图片数据集，每张图片可能同时包含多个物体。我们可以使用 nn.BCEWithLogitsLoss 来处理这种情况。

以下是一个具体的示例，展示了如何在多标签多分类任务中应用 nn.BCEWithLogitsLoss：

import torch
import torch.nn as nn
import torch.optim as optim

# 简单的神经网络模型
class SimpleNN(nn.Module):
    def __init__(self, input_dim, output_dim):
        super(SimpleNN, self).__init__()
        self.fc = nn.Linear(input_dim, output_dim)
    
    def forward(self, x):
        return self.fc(x)

# 模拟数据
batch_size = 4
input_dim = 5
output_dim = 3

inputs = torch.randn(batch_size, input_dim)
targets = torch.randint(0, 2, (batch_size, output_dim)).float()

# 创建模型、损失函数和优化器
model = SimpleNN(input_dim, output_dim)
criterion = nn.BCEWithLogitsLoss()
optimizer = optim.SGD(model.parameters(), lr=0.01)

# 前向传播
outputs = model(inputs)
loss = criterion(outputs, targets)

print(f'Initial loss: {loss.item()}')

# 反向传播和优化
optimizer.zero_grad()
loss.backward()
optimizer.step()

# 打印更新后的损失
outputs = model(inputs)
loss = criterion(outputs, targets)
print(f'Updated loss: {loss.item()}')

总结表格

通过以上示例，我们可以看到 nn.BCEWithLogitsLoss 在不同参数设置下的具体应用。每个参数都能显著影响损失的计算方式，从而适应不同的任务需求。

参数	作用	示例结果
`weight`	对每个类别设置不同的权重，用于加权计算损失	Weighted Loss: 0.1967
`size_average`	如果为 `True`，损失将会被平均，否则会被求和（已弃用，使用 `reduction` 参数）	-
`reduce`	如果为 `True`，损失将会被求和（已弃用，使用 `reduction` 参数）	-
`reduction`	指定损失的归约方式，可以是 `'none'`、`'mean'` 或 `'sum'`	No Reduction Loss: tensor([[0.4741, 0.2654, 0.0474], [0.1687, 0.0613, 0.1978]]), Sum Reduction Loss: 1.2147, Mean Reduction Loss: 0.2024

通过这个代码实现，您可以更好地理解 nn.BCEWithLogitsLoss 的计算过程及其各个参数的作用。在实际应用中，根据具体任务需求选择合适的参数设置，可以有效地提高模型的性能。

想胖的壮壮

关注

18
点赞
踩
16

收藏

觉得还不错? 一键收藏
0
评论
多标签多分类损失计算 - BCEWithLogitsLoss详解

通过以上示例，我们可以看到在不同参数设置下的具体应用。每个参数都能显著影响损失的计算方式，从而适应不同的任务需求。参数作用示例结果weight对每个类别设置不同的权重，用于加权计算损失如果为True，损失将会被平均，否则会被求和（已弃用，使用reduction参数）reduce如果为True，损失将会被求和（已弃用，使用reduction参数）reduction指定损失的归约方式，可以是'none''mean'或'sum'通过这个代码实现，您可以更好地理解的计算过程及其各个参数的作用。
复制链接

扫一扫