人脸检测文献阅读

本文总结了人脸检测的深度学习方法,包括联合训练级联CNN的优势,如Faster R-CNN的多阶段检测思想。还探讨了Scale-Aware Face Detection策略,通过Scale Proposal Network (SPN)估计人脸尺度,再使用Single-Scale RPN进行高效检测。未来可能将这种方法扩展到一般物体检测,并优化模型大小和速度。
摘要由CSDN通过智能技术生成

随便搜索了一下: 人脸检测识别文献阅读总结

检测文章中一般都细节很多,这里只总结主要思路。

Joint Training of Cascaded CNN for Face Detection

cascade的优势:

handle unbalanced distribution of negative and positive samples. In the early stages, week classifiers can reject most false negatives. In the later stages, stronger classifiers can save computation with less proposals. 如Faster R-CNN

这里写图片描述

这里的三层结构类似cascaded cnn,不过位置估计使用了bbox regression而不是边界校准网络。看4.3. Testing pipeline,下一层的输入图片是用上一层中通过阈值的那些Box从原图中截取的。这样相当于首先在粗粒度检测是否有人脸和人脸的大概位置,然后把ROI截取出来,在细粒度做进一步检测和定位(注意与image pyramid区分,设计的目的不太一样)。第二层是hard negative sample mining,第三层是harder negative mining,即逐渐解决更困难的样本。

另外注意训练过程的设计:

The principle is to make the threshold as high as possible while keeping the recall, so as to reject as many proposals as possible in the earlier stages.

文章处理人脸尺度变化的办法是用image pyramid,每一个层次都作为上图中的input。

Scale-Aware Face Detection
cnn在多尺度人脸检测的办法:

either fitting a large single model to faces across a large scale range or multi-scale testing (如上文中的image pyramid).

但是这样引入了较大的计算量,本文的想法是先用一个cnn估计图片中人脸的尺度分布,然后对图片进行放大和缩小。

In this way, the face detection procedure can be divided into face scale estimation and single scale detection.

如图:
这里写图片描述

第一阶段Scale Proposal Network (SPN):
这里写图片描述

注意如何为SPN生成ground truth直方图。

第二阶段检测器(RPN)

Since the face size variation is already handled in the first stage, in this stage, we only use an RPN with one anchor. The largest detectable face size is set to be twice the size of the smallest detectable face. This configuration is enough to achieve high accuracy while keeping average zooms per image low and the RPN computationally cheap. The RPN we use is called Single-Scale RPN, since it has only one anchor and has a narrow face size coverage.

未来可扩展的方向

The proposed method can also be applied to general object detection problems. Moreover, the SPN is essentially a weakly-supervised detector, which could be used to generate coarse region proposals and further improves speed. SPN can also share convolution layers with RPN to further reduce model size.

一个问题,还可以用cascade提高效率吗?之前遇到过的其他检测提速方法: 降低proposal, yolo等, rfcn和light head RCNN。

另外这次设计了较多的尺度问题,如image pyramid和SPN。可以结合FPN paper考虑考虑:
这里写图片描述

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值