论文速看[2019-1-17]-Support Vector Guided Softmax Loss for Face Recognition

最新推荐文章于 2021-03-17 20:43:12 发布

alfred_torres

最新推荐文章于 2021-03-17 20:43:12 发布

阅读量908

点赞数 1

分类专栏：人脸识别论文速看

本文链接：https://blog.csdn.net/alfred_torres/article/details/86519542

版权

人脸识别同时被 2 个专栏收录

12 篇文章

订阅专栏

论文速看

4 篇文章

订阅专栏

本文提出了一种名为SV-Softmax的损失函数，它结合了mining-based和margin-based方法的优点，通过聚焦于支持向量来指导判别特征的学习，从而消除硬样本的模糊性并吸收其他类别的判别能力。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

一篇关于人脸识别中loss改进的方法
文章i地址：https://128.84.21.199/abs/1812.11317
作者github地址：https://github.com/xiaoboCASIA/SV-X-Softmax还未完成
有一个博主做了一个除了实验的全文翻译，想要看中文的可以去
calvinpaean：Support Vector Guided Softmax Loss for Face Recognition 论文学习

摘要

Face recognition has witnessed signiﬁcant progresses due to the advances of deep convolutional neural networks (CNNs), the central challenge of which, is feature discrimination. To address it, one group tries to exploit mining-based strategies (e.g., hard example mining and focal loss) to focus on the informative examples. The other group devotes to designing margin-based loss functions (e.g., angular, additive and additive angular margins) to increase the feature margin from the perspective of ground truth class. Both of them have been well-veriﬁed to learn discriminative features. However, they suffer from either the ambiguity of hard examples or the lack of discriminative power of other classes. In this paper, we design a novel loss function, namely support vector guided softmax loss (SV-Softmax), which adaptively emphasizes the mis-classiﬁed points (support vectors) to guide the discriminative features learning. So the developed SV-Softmax loss is able to eliminate the ambiguity of hard examples as well as absorb the discriminative power of other classes, and thus results in more discrimiantive features. To the best of our knowledge, this is the ﬁrst attempt to inherit the advantages of mining-based and margin-based losses into one framework. Experimental results on several benchmarks have demonstrated the effectiveness of our approach over state-of-the-arts.

作者分析了人脸识别中两种从loss方面促进feature discriminative的方法：mining-based和margin-based loss function，这两种方法都存在其局限性，作者提出了将两种方法结合的方法SV-Softmax loss。

main contribution

We propose a novel SV-Softmax loss, which eliminates the ambiguity of hard examples as well as absorbs the discriminative power of other classes by focusing on support vectors. To the best of our knowledge, this is the ﬁrst attempt to semantically fuse the mining-based and margin-based losses into one framework.
We deeply analyze the relations of our SV-Softmax loss to the current mining-based and margin-based losses, and further develop an improved version SV-X-Softmax loss to enhance the feature discrimiantion.
作者仔细分析了提出的loss和focal loss、arc loss之间的区别和联系，这一部分分析的很好，是文章理论分析的亮点。
We conduct extensive experiments on the benchmarks of LFW [8], MegaFace Challenge [9, 16] and Trillion Pairs Challenge, which have veriﬁed the superiority of our new approach over the baseline Softmax loss, the mining-based Softmax losses, the margin-based Soft-max losses, and their naive fusions.
作者做了大量的实验，实验的细节写的很多。

SV-Softmax loss

放几个公式和图来说明作者的思想
Softmax loss
$\mathcal L_1=-log\frac{e^{scos(\theta _{w_y,x})}}{e^{scos(\theta _{w_y,x})}+\sum_{k\neq y}^{K}e^{scos(\theta _{w_k,x})}}$
Mining-based Softmax
$\mathcal L_2=-g(p_y)log\frac{e^{scos(\theta _{w_y,x})}}{e^{scos(\theta _{w_y,x})}+\sum_{k\neq y}^{K}e^{scos(\theta _{w_k,x})}}$
mining-based softmax比softmax多了一个 $g(p_y)$ ，利用这个 $g(p_y)$ 来对hard samples着重进行训练。在Focal loss中， $g(p_y)=(1-p_y)^\gamma$ ;在HM-Softmax中，hard sample时 $g(p_y)=1$ ，否则 $g(p_y)=0$ 。
Margin-based loss
根据arcface、sphereface、a-softmax几个loss的改进结果
$\mathcal L_3=-log\frac{e^{sf(m,\theta_{w_y,x}))}}{e^{sf(m,\theta_{w_y,x})}+\sum_{k\neq y}^{K}e^{scos(\theta _{w_k,x})}}$
上式中， $f(m,\theta_{w_y},x)$ 是精心设计的margin function.
fig1

Naive Mining-Margin Softmax loss

$\mathcal L_4=-g(p_y))log\frac{e^{sf(m,\theta_{w_y,x}))}}{e^{sf(m,\theta_{w_y,x})}+\sum_{k\neq y}^{K}e^{scos(\theta _{w_k,x})}}$
fig2

Support Vector Guided Softmax Loss

$\mathcal L_5=-g(p_y))log\frac{e^{scos(\theta_{w_y,x})}}{e^{scos(\theta_{w_y,x})}+\sum_{k\neq y}^{K}h(t,\theta_{w_k,x},I_k)e^{scos(\theta _{w_k,x})}}$
其中， $h(t,\theta_{w_k},x,I_k)=e^{s(t-1(cos(\theta_{w_k,x})+1)I_k}$
当 $t = 1$ 时，上式就等价于softmax

SV-X-Softmax

$\mathcal L_5=-g(p_y))log\frac{e^{sf(m,\theta_{w_y,x})}}{e^{sf(m,\theta_{w_y,x})}+\sum_{k\neq y}^{K}h(t,\theta_{w_k,x},I_k)e^{scos(\theta _{w_k,x})}}$
其中， $h(t,\theta_{w_k},x,I_k)=e^{s(t-1(cos(\theta_{w_k,x})+1)I_k}$
fig3