【论文笔记】SphereFace: Deep Hypersphere Embedding for Face Recognition

最新推荐文章于 2023-05-08 14:59:18 发布

有来有去-CV

最新推荐文章于 2023-05-08 14:59:18 发布

阅读量4.6k

点赞数 3

分类专栏： CV理论知识 CV参考资料 CV论文笔记

本文链接：https://blog.csdn.net/shaoxiaohu1/article/details/78885080

版权

CV论文笔记同时被 3 个专栏收录

17 篇文章 16 订阅

订阅专栏

CV理论知识

15 篇文章 1 订阅

订阅专栏

CV参考资料

14 篇文章 0 订阅

订阅专栏

参考文献： Liu W, Wen Y, Yu Z, et al. SphereFace: Deep Hypersphere Embedding for Face Recognition[J]. arXiv preprint arXiv:1704.08063, 2017.

摘要

之前写过一篇large-marin softmax (L-Softmax) 的介绍，与Softmax Loss 相比，它能够学习区分度更高的特征。基于L-Softmax的改进，这篇paper提出Angular-Softmax（A-Softmax）去学习判别特征，它在超球面流形上强加了一个判别约束，而这个超球面流形本质上与人脸的先验知识位于同一个流形上。A-Softmax在人脸数据库LFW/YTF/MegaFace上的识别结果均优化其它loss函数。与L-Softmax类似，angular margin 同样可以由一个参数 m 来调整。

算法源码

算法介绍

1. Softmax Loss

在介绍A-Softmax之前，我们先来回顾softmax loss。当定义第 $i$ 个输入特征 $\mathbf{x_i}$ 以及它的标签 $y_i$ 时，softmax loss 记为：

L = 1 N \sum i L i = 1 N \sum i - l o g (e f y i \sum j e f j)

$L= \frac{1}{N} \sum_{i}{L_i}=\frac{1}{N} \sum_{i}{-log(\frac{e^{f_{y_i}}}{\sum_je^{f_j}})}$
其中

fj $f_j$ 表示最终全连接层的类别输出向量

f $\mathbf{f}$ 的第

j $j$ 个元素,

N $N$ 为训练样本的个数。由于

f $\mathbf{f}$ 是全连接层的激活函数

W $\mathbf{W}$ 的输出，所以

fyi $f_{y_i}$ 可以表示为

fyi=WTyixi+byi $f_{y_i}=\mathbf{W}_{y_i}^{T}\mathbf{x}_i+b_{y_i}$ , 最终的损失函数又可以写为：

L i = - l o g (e ∥ W y i ∥ ∥ x i ∥ c o s ( θ y i , i ) + b y i \sum j e ∥ W j ∥ ∥ x i ∥ c o s ( θ j , i ) + b j)

$L_i= -log(\frac{e^{\Vert\mathbf{W}_{y_i}\Vert\Vert\mathbf{x}_i\Vert cos(\theta_{y_i,i})+b_{y_i}}} {\sum_j{e^{\Vert\mathbf{W}_j\Vert\Vert\mathbf{x_i}\Vert cos(\theta_{j,i})+b_j}}})$
其中

θ(j,i $\theta(_{j,i}$

0≤θj,i≤π $0\le\theta_{j,i}\le\pi$ )是

Wj $\mathbf{W}_j$ 和

xi $\mathbf{x}_i$ 之间的夹角。当

Wj=1 $\mathbf{W_j}=1$ ，

bj=0 $b_j=0$ 时，我们可以得到一个修改的softmax loss:

L m o d i f i e d = - l o g (e ∥ x i ∥ c o s ( θ y i , i ) \sum j e ∥ x i ∥ c o s ( θ j , i ))

$L_{modified}= -log(\frac{e^{\Vert\mathbf{x}_i\Vert cos(\theta_{y_i,i})}} {\sum_j{e^{\Vert\mathbf{x_i}\Vert cos(\theta_{j,i})}}})$

PS: 与L-Softmax不同的是，作者除了假设 $b_j=0$ ，还将 $\Vert\mathbf{W_j}\Vert$ 设为1。

2. 引入Angular margin

为了便于说明，作者以二分类作为示例。为了将属于类1特征 $\mathbf{x}$ 正确分类，修改后的softmax损失函数要求 $cos(\theta_1)>cos(\theta_2)$ ，即 $\theta_1<\theta_2$ 。本文在此基础上增加一个参数 $m(m\ge2)$ ，此时要正确分类，需使 $cos(m\theta_1)>cos(\theta_2)$ ，即 $\theta_1<\theta_2/m$ ， $\theta_2<\theta_1/m$ 。这样就增强了判决的约束，使得学习出的特征的区分更强。根据这种思想修改的softmax loss函数为：

L a n g = - l o g (e ∥ x i ∥ c o s ( m θ y i , i ) e ∥ x i ∥ c o s ( m θ y i , i ) + \sum j \neq y i e ∥ x i ∥ c o s ( θ j , i ))

$L_{ang}= -log(\frac{e^{\Vert\mathbf{x}_i\Vert cos(m\theta_{y_i,i})}} {e^{\Vert\mathbf{x}_i\Vert cos(m\theta_{y_i,i})}+\sum_{j\neq y_i}{e^{\Vert\mathbf{x}_i\Vert cos(\theta_{j,i})}}})$
其中

0≤θyi,i≤πm $0 \le \theta_{y_i,i}\le \frac{\pi}{m}$ 。与L-Softmax论文中相同，为了保证上式能在CNN中进行前/后向反馈，上式变换为：

L a n g = - l o g (e ∥ x i ∥ ψ ( θ y i , i ) e ∥ x i ∥ ψ ( θ y i , i ) + \sum j \neq y i e ∥ x i ∥ c o s ( θ j , i ))

$L_{ang}= -log(\frac{e^{\Vert\mathbf{x}_i\Vert \psi(\theta_{y_i,i})}} {e^{\Vert\mathbf{x}_i\Vert \psi(\theta_{y_i,i})}+\sum_{j\neq y_i}{e^{\Vert\mathbf{x_i}\Vert cos(\theta_{j,i})}}})$
在这里，

ψ(θ) $\psi(\theta)$ 可以表示为：