ML基本知识（十六）损失函数

最新推荐文章于 2022-12-18 06:06:05 发布

LightYoungLee

最新推荐文章于 2022-12-18 06:06:05 发布

阅读量1.9k

点赞数 1

分类专栏： ML基本知识文章标签：机器学习

本文链接：https://blog.csdn.net/weixin_37688445/article/details/117917746

版权

ML基本知识专栏收录该内容

16 篇文章 1 订阅

订阅专栏

常规

contrastive loss

对比损失，让相似样本尽量相似，非相似样本尽量不相似，公式如下所示：

$\frac{1}{N}\sum_{n=1}^N yd^2+(1-y)max(margin-d,0)^2$

triplet loss

从名称上可以看出，该损失函数的输入由三部分构成，这三部分分别是anchor(锚点)、positive(正例)以及negative(负例)。triplet loss的核心思想由有如下三部分构成：

anchor与negative差异越大越好
anchor与positive差异越小越好
positive与negative差异越大越好

基于上面三个子思想，写出triplet loss的公式，其中anc、pos以及neg是anchor、positive以及negative在模型中的表示，可以理解为 $a n c = f (a n c h o r)$ ， $p o s = f (p o s i t i v e)$ ， $n e g = f (n e g a t i v e)$ 。
$\left\{\begin{matrix} 2 * \left \| pos - anc \right \|^2 - \left \| pos - neg \right \|^2 - \left \| anc - neg \right \|^2 + margin \ \ \ \ if \ label = 1 \\ 2 * \left \| neg - anc \right \|^2 - \left \| neg -pos \right \|^2 - \left \| anc - pos \right \|^2 + margin \ \ \ \ if \ label = 0 \end{matrix}\right.$

上述公式的含义为：

如果label为1，则pos与anc差异越小越好 + pos与neg差异越大越好 + anc与neg差异越大越好
如果label为0，则neg与anc差异越小越好 + pos与neg差异越大越好 + anc与pos差异越大越好
margin存在的意义为提升模型学习的难度，因为如果不加margin，则模型很容易把anc、pos以及neg弄成0，这样loss也会很小。

这里给出triplet loss的tf代码实现：

def triplet_loss(self, pos, neg, anc, label):
    part1 = tf.reduce_sum(tf.square(pos - anc), axis=-1)
    part2 = tf.reduce_sum(tf.square(pos - neg), axis=-1)
    part3 = tf.reduce_sum(tf.square(anc - neg), axis=-1)

    part1_1 = tf.reduce_sum(tf.square(neg - anc), axis=-1)
    part2_1 = tf.reduce_sum(tf.square(neg - pos), axis=-1)
    part3_1 = tf.reduce_sum(tf.square(anc - pos), axis=-1)

    loss1 = tf.expand_dims(2 * part1 - part2 - part3 + self.triplet_loss_margin, axis=-1)
    loss2 = tf.expand_dims(2 * part1_1 - part2_1 - part3_1 + self.triplet_loss_margin, axis=-1)
    loss = tf.where(tf.equal(label, 1.0), loss1, loss2)

    return loss

参考

LightYoungLee

关注

1
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
ML基本知识（十六）损失函数

推荐系统相关pairwise hinge loss其衡量的是pairwise场景下正负样本的差异，公式如下所示，其中marginmarginmargin代表的是预设的阈值，uuu代表输入query，d+d+d+代表的是正样本，d−d-d−代表的是负样本，<><><>代表的是两个向量之间的相似度，该公式代表的含义是只有当输入query与正样本足够相似时，loss才会降为0，否则与正样本越不相似或者与负样本越相似，则loss都会变得很大。loss=max(0,margi
复制链接

扫一扫