Crossentropyloss与 BCELoss差异

夏日清风有你

已于 2023-04-03 19:26:59 修改

阅读量322

点赞数

分类专栏： PyTorch 文章标签：深度学习人工智能 python

于 2023-02-14 17:17:49 首次发布

原文链接：https://blog.csdn.net/loki2018/article/details/127210390

版权

PyTorch 专栏收录该内容

50 篇文章 1 订阅

订阅专栏

文章详细解释了在PyTorch中，Crossentropyloss用于多分类任务，其输入是经过softmax的预测概率和long类型的标签，而BCELoss（包括BCEWithLogitsLoss）适用于二分类，需要对预测值先应用sigmoid并使用float类型的标签。同时，文中给出了手动实现这两种损失函数的代码示例。

摘要由CSDN通过智能技术生成

Crossentropyloss与 BCELoss差异

在这里插入图片描述

python报错：

1only batches of spatial targets supported (non-empty 3D tensors) but got targets of size

因为在使用Crossentropyloss作为损失函数时，output=net(input)的output应该是[batchsize, channel, height, weight]，而label则是[batchsize, height, weight]，label是单通道灰度图.

而在BCELoss中，两者都是[batchsize, channel, height, weight]。

报错是因为label本应该是一维的，我在保存的时候处理成了3维。

BCELoss

在这里插入图片描述

主要用于计算标签只有1或者0时的二分类损失，标签和预测值是一一对应的。需要注意的是，通过nn.BCEloss来计算损失前，需要对预测值进行一次sigmoid计算。sigmoid函数会将预测值映射到0-1之间。
如果觉得手动加sigmoid函数麻烦，可以直接调用nn.BCEwithlogitsloss。

使用nn.BCEloss计算损失

import torch
import torch.nn as nn
import torch.nn.functional as F

loss = nn.BCELoss(reduction="none")
target = torch.tensor([1,0,1], dtype=torch.float32)
predict = torch.tensor([0.8, 0.2, 0.3], dtype=torch.float32)
loss(F.sigmoid(predict), target)

#结果计算为：
tensor([0.3711, 0.7981, 0.5544])

手动实现nn.BCEloss

def myBceloss(predict, target, reduction="none"):
    predict = F.sigmoid(predict)
    if reduction == "none":
        return -(target*torch.log(predict) + (1-target)*torch.log(1-predict))     
myBceloss(predict, target)

#结果计算为：
tensor([0.3711, 0.7981, 0.5544])

Crossentropyloss

在这里插入图片描述
用于计算多分类任务，一个标签可能对应了预测的多个概率，例如一个任务包含了C个类别，那么预测值就有C 个。

使用nn.CrossEntropyLoss计算损失

（已经使用了softmax）

loss2 = nn.CrossEntropyLoss(reduction="none")
target2 = torch.tensor([0, 1, 2])
predict2 = torch.tensor([[0.9, 0.2, 0.8], [0.5, 0.2, 0.4], [0.4, 0.2, 0.9]])
loss2(predict2, target2)

#结果计算为：
tensor([0.8761, 1.2729, 0.7434])

手动实现nn.CrossEntropyLoss

def myCrossEntropyloss(target, predict, reduction="none"):
    if reduction == "none":
        predict = F.softmax(predict, dim=1)
        n = torch.arange(predict.shape[0])
        predict = predict[n, target]
        return -torch.log(predict)
myCrossEntropyloss(target2, predict2)
#结果计算为：
tensor([0.8761, 1.2729, 0.7434])

注意

BCEWithLogitsLoss 要求它的目标是一个float 张量，而不是long。
Crossentropyloss 的标签要求是long类型
通过dtype=torch.float32指定t张量的类型

import torch
t = torch.tensor([[1, 0, 1, 1]], dtype=torch.float32).T
p = torch.rand(4,1)
loss_fn = torch.nn.BCEWithLogitsLoss()
print(loss_fn(p, t))

原文链接：https://blog.csdn.net/loki2018/article/details/127210390

夏日清风有你

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录