tensorflow/keras与pytorch的交叉熵对比

最新推荐文章于 2024-08-12 10:41:02 发布

Jemila

最新推荐文章于 2024-08-12 10:41:02 发布

阅读量1.6k

点赞数 2

分类专栏： tensorflow pytorch 文章标签： tensorflow pytorch

本文链接：https://blog.csdn.net/jemila/article/details/115864939

版权

本文对比了PyTorch中的BCELoss与BCEWithLogitsLoss，以及nll_loss和cross_entropy_loss。在TensorFlow/Keras中，探讨了sparse_categorical_crossentropy与sparse_softmax_cross_entropy_with_logits的区别，并解释了'logits'的概念。还提到了在不同库中处理mask的方法，以及one-hot编码与数字编码的差异。

摘要由CSDN通过智能技术生成

pytorch交叉熵损失

BCELoss vs BCEWithLogitsLoss

bceloss前面需要经过sigmoid

BCEWithLogitsLoss就是把Sigmoid-BCELoss合成一步

input = torch.tensor(np.arange(3)/3).reshape(3,1)
input2 = torch.nn.Sigmoid()(input)
loss = torch.nn.BCELoss()
target = torch.tensor([[0],[1],[1]]).to(torch.double)
print(loss(input2,target))

loss = torch.nn.BCEWithLogitsLoss()
print (loss(input,target))
#输出
#tensor(0.5493, dtype=torch.float64)
#tensor(0.5493, dtype=torch.float64)

nll_loss和cross_entropy loss

和上面一样，CrossEntropyLoss就是把以上Softmax–Log–NLLLoss合并成一步，注意这里还需要多经过一层log

input = torch.randn(3,3)
input2 = torch.log(torch.nn.Softmax(dim=1)(input))
loss = torch.nn.NLLLoss()
target = torch.tensor([0,2,1])
print(loss(input2,target))

loss = torch.nn.CrossEntropyLoss()
print (loss(input,target))
#输出
#tensor(1.0774)
#tensor(1.0774)

tensorflow/keras交叉熵损失

这里就只讨论多分类的

sparse_categorical_crossentropy 与 sparse_softmax_cross_entropy_with_logits

这个函数对应pytorch的cross_entropy

在tensorflow里的"logits"指的其实是，该方法是在logit数值上使用softmax或者sigmoid来进行normalization的，也暗示用户不要将网络输出进行sigmoid或者softmax，这些过程可以在函数内部更高效地计算。

在torch里：

outputs=torch.tensor([[[1.4,1.2],[0.0,0.0],[0.3,1.8],[0.3,0.0]],[[1.4,1.2],[0.0,0.0],[0.3,1.8],[0.3,0.0]]]).to(torch.float32)
labels =torch.tensor([[0,0],[0,0]])
loss =  torch.nn.CrossEntropyLoss()(outputs, labels)
loss,outputs.shape,labels.shape
#输出
#(tensor(0.9396), torch.Size([2, 4, 2]), torch.Size([2, 2]))

在tensorflow/keras里，tf里的sparse_softmax_cross_entropy_wi