https://www.arxiv.org/abs/1608.08021
Demo code: https://github.com/sanghoon/pva-faster-rcnn
本文针对多种类目标检测这个问题,结合当前各种最新技术成果,达到很好的结果。
We obtained solid results on well-known object detection benchmarks: 81.8% mAP (mean average precision) on VOC2007 and 82.5% mAP on VOC2012 (2nd place), while taking only 750ms/image on Intel i7-6700K CPU with a single core and 46ms/image on NVIDIA Titan X GPU. Theoretically, our network requires only 12.3% of the computational cost compared to ResNet-101, the winner on VOC2012
针对整体检测框架:CNN feature extraction + region proposal + RoI classification
我们主要优化 feature extraction,因为 region proposal part 速度比较快,不占用什么时间。分类部分可以通过 SVD 进行有效压缩模型复杂度。 我们的设计原则是: 少点特征种类,多点层数。less channels with more layers。 设计网络采用了 concatenated ReLU, Inception, and HyperNet&