【Auxiliary FAS】《Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision》-CSDN博客

本文链接：https://blog.csdn.net/bryant_meng/article/details/144960888

在这里插入图片描述

CVPR-2018

文章目录

1、Background and Motivation
2、Related Work
3、Advantages / Contributions
4、Method
5、Experiments
6、Conclusion（own） / Future work

1、Background and Motivation

人脸识别应用越来越广泛， face anti-spoofing is vital to ensure that face recognition systems are robust to PA and safe to use.

传统的人脸活体检测基于 texture-based + machine learning

基于深度学习的活检通常 formulate as a binary classification problem，泛化性能比较弱，不足以应付 the different
levels of image degradation, namely spoof patterns

假也很多种形式，真假界限模糊，决策过程不能简单的非黑即白——without explanation or rationale for the decision

在这里插入图片描述

本文作者采用的是 auxiliary supervision 而不是 binary supervision，结合深度图和 rPPG 信号，来辅助决策活检最终的结果

2、Related Work

Texture-based Methods
Temporal-based Methods
Haralick features, motion mag, and optical flow
Remote Photoplethysmography (rPPG)
Remote photoplethysmography (rPPG) is the technique to track vital signals, such as heart rate，without any contact with human skin

基于rPPG的人脸活体检测综述
在这里插入图片描述

3、Advantages / Contributions

用深度图和 rPPG 同时辅助监督来做人脸活体检测，采用 CNN + RNN 的结构，兼顾空间和时序信息
（ a novel CNN-RNN architecture for end-to-end learning the depth map and rPPG signal.）
公开 Spoof in the Wild Database (SiW) 数据集

4、Method

在这里插入图片描述

CNN + RNN，depth + rPPG

更细节的图示

在这里插入图片描述

（1）Depth Map Supervision

pixel-wise supervision

3D mask 的标签来自

Jourabloo A, Liu X. Pose-invariant face alignment via CNN-based dense 3D model fitting[J]. International Journal of Computer Vision, 2017, 124: 187-203.

Liu Y, Jourabloo A, Ren W, et al. Dense face alignment[C]//Proceedings of the IEEE International Conference on Computer Vision Workshops. 2017: 1619-1628.

在这里插入图片描述

真人脸有标签，假人脸标签 depth 全部置为 0

RNN 的输入之一 3D shape $S$ 也是该方法计算得到，深度图监督信息根据 $S$ 计算得出来的

在这里插入图片描述

（2）rPPG Supervision

sequence-wise supervision，rPPG 信号监督

rPPG 信号的监督信息计算方法来自，输入 video 提取即可

De Haan G, Jeanne V. Robust pulse rate from chrominance-based rPPG[J]. IEEE transactions on biomedical engineering, 2013, 60(10): 2878-2886.

上述方法提取出来的 rPPG 信号有如下缺点

sensitive to pose and expression variation、illumination change、
not be sufficiently distinguishable to signals of live videos

作者用 RNN 网络来学 rPPG 信号，现在标签都不太准确怎么办？

作者采用的是 pseudo-rPPG signal

作者假设同一个人各种 poses, illuminations, expressions (PIE) 情况下的 rPPG和正常情况下一致

the same subject under different PIE conditions have the same ground truth rPPG signal.

理由：输入 video 的那短暂时间内，心跳频率相似，不太受环境和姿态的影响

since the heart beat is similar for the videos of the same subject that are captured in a short span of time (< 5 minutes).

由此一来，我们可以用正常 PIE 下人脸的 rPPG 信号作为标签，来监督各类 PIE 情况，提升网络对 PIE 的鲁棒性同时，通过
《Robust pulse rate from chrominance-based rPPG》方法计算得到的标签也更真实可信（最理想情况下计算得到的最准）

大白话说，不论什么 PIE，我用正常 PIE 下计算得到的 rPPG 信号作为监督信号用于 RNN 网络训练

在这里插入图片描述
真人脸有 rPPG 信号，假人脸 rPPG 信号为 0

（3）Network Architecture

在这里插入图片描述

CNN 仅用深度信息监督

RNN 用 rPPG 信号监督，RNN 的输入有深度图，特征图和

first stream only updates the weights of the CNN part, the back propagation of the second stream updates the weights of both CNN and RNN parts in an end-to-end manner.

CNN 优化函数

在这里插入图片描述