《Cross-modal retrieval常用torch版本loss》总结

最新推荐文章于 2024-06-17 00:43:27 发布

waiall

最新推荐文章于 2024-06-17 00:43:27 发布

阅读量544

点赞数 2

分类专栏： java 文章标签：深度学习 pytorch

本文链接：https://blog.csdn.net/uestc_huhu/article/details/115465812

版权

java 专栏收录该内容

28 篇文章 1 订阅

订阅专栏

1.MSE Loss(回归类)

均分误差：Creates a criterion that measures the mean squared error (squared L2 norm) between each element in the input xx and target yy .
在这里插入图片描述

    loss = nn.MSELoss()
    input = torch.randn(3, 5, requires_grad=True)
    target = torch.randn(3, 5)
    output = loss(input, target)
    output.backward()

    print(input)
    print(target)
    print(output)

tensor([[ 0.8964,  1.6948,  0.5003,  0.6851, -0.7712],
        [-0.7480, -0.1916,  0.4495, -1.2375, -0.7038],
        [-1.5244,  0.2029, -0.7153, -1.4792,  1.1071]], requires_grad=True)
tensor([[-1.2733,  0.2253,  0.1926, -1.1926,  0.5637],
        [-0.9189,  2.7922,  0.2730,  0.6243, -1.2396],
        [ 1.2193, -0.6027, -0.1948,  0.5456, -0.3350]])
tensor(2.6409, grad_fn=<MseLossBackward>)

2.BCELOSS（分类类）

torch.nn.BCELoss(weight=None, size_average=None, reduce=None, reduction=‘mean’)
功能：创造一个熵去测量output和target之间的二进制交叉熵
在这里插入图片描述
这个用于测量自动编码器的重建误差，目标y应该是（0,1）之间的数字。

    m = nn.Sigmoid()
    loss = nn.BCELoss()
    input = torch.randn(3, requires_grad=True)
    target = torch.empty(3).random_(2)
    output = loss(m(input), target)
    output.backward()

    print(m(input))
    print(target)
    
tensor([0.2943, 0.4337, 0.3497], grad_fn=<SigmoidBackward>)
tensor([0., 1., 1.])
tensor(0.7449, grad_fn=<BinaryCrossEntropyBackward>)

1.BCEWithLogitsLoss
在这里插入图片描述

    loss = nn.BCEWithLogitsLoss()
    input = torch.randn(3, requires_grad=True)
    target = torch.empty(3).random_(2)
    output = loss(input, target)
    output.backward()

    print(input)
    print(target)
    print(output)
    
tensor([0.1406, 0.4081, 1.5632], requires_grad=True)
tensor([1., 1., 0.])
tensor(0.9628, grad_fn=<BinaryCrossEntropyWithLogitsBackward>)

3.KL（分布匹配类）

度量两个概率分布之间的差异程度
torch.nn.KLDivLoss(size_average=None, reduce=None, reduction=‘mean’, log_target=False)
Kullback-Leibler散度是连续分布的有用距离度量，在对（离散采样）连续输出分布的空间进行直接回归时通常很有用。
在这里插入图片描述

参考链接：https://zhuanlan.zhihu.com/p/339613080

4.CrossEntropyLoss

torch.nn.CrossEntropyLoss(weight=None, size_average=None, ignore_index=-100, reduce=None, reduction=‘mean’)
这个交叉熵结合了LogSoftmax和NLLLOSS，在训练带有C类的分类问题时很有用。如果提供，则可选参数weight应为一维张量，为每个类分配权重。当您的训练集不平衡时，此功能特别有用。该输入预计将包含原始，非标准化的分数为每个类。
在这里插入图片描述

loss = nn.CrossEntropyLoss()
    input = torch.randn(3, 5, requires_grad=True)
    target = torch.empty(3, dtype=torch.long).random_(5)
    output = loss(input, target)
    output.backward()

    print(input)
    print(target)
    print(output)

tensor([[-1.3169,  0.6902, -0.3976, -0.1056,  1.6268],
        [-0.1469, -0.8665, -0.7510, -0.5994,  0.0559],
        [-0.5486, -0.6443, -0.1930,  0.7109,  0.0054]], requires_grad=True)
tensor([2, 3, 4])
tensor(1.9986, grad_fn=<NllLossBackward>)

5.TripletMarginLoss

torch.nn.TripletMarginLoss(margin=1.0, p=2.0, eps=1e-06, swap=False, size_average=None, reduce=None, reduction=‘mean’)
创建一个标准来衡量给定输入张量的三重态损失 11x 1 ， 2倍X 2 ， 3倍X 3 且边距值大于 00 。这用于测量样本之间的相对相似性。一个三元组由a，p和n组成（分别是anchor，正例和负例）。所有输入张量的形状应为（N，D）
在这里插入图片描述

triplet_loss = nn.TripletMarginLoss(margin=1.0, p=2)
    anchor = torch.randn(2, 3, requires_grad=True)
    positive = torch.randn(2, 3, requires_grad=True)
    negative = torch.randn(2, 3, requires_grad=True)
    output = triplet_loss(anchor, positive, negative)
    output.backward()

    print(anchor)
    print(positive)
    print(negative)
    print(output)
tensor([[ 0.2258, -0.6545, -1.5043],
        [ 1.0083,  0.6198, -0.1240]], requires_grad=True)
tensor([[ 1.8021,  1.9506, -0.6078],
        [-0.1537, -0.2082, -0.6502]], requires_grad=True)
tensor([[-0.9203,  0.4674, -0.6659],
        [-1.3081, -1.0276,  0.9357]], requires_grad=True)
tensor(1.1822, grad_fn=<MeanBackward0>)

torch.nn.TripletMarginWithDistanceLoss(*, distance_function=None, margin=1.0, swap=False, reduction=‘mean’)

6.NLLLoss

torch.nn.NLLLoss(weight=None, size_average=None, ignore_index=-100, reduce=None, reduction=‘mean’)
负对数似然损失。用C类训练分类问题很有用。如果提供，则可选参数weight应为一维张量，为每个类分配权重。当您的训练集不平衡时，此功能特别有用。