R-CNN, Fast R-CNN, Faster R-CNN
今年四月份的时候,在一个研究院实习时学习了RCNN, Fast RCNN, Faster RCNN系列Object Detection框架,现在总结一下。
一. R-CNN(Regions with CNN features)
1.1 框架结构
论文中提到:
Our object detection system consists of three modules.
The first generates category-independent region proposals. These proposals define the set of candidate detections available to our detector.
The second module is a large convolutional neural network that extracts a fixed-length feature vector from each region.
The third module is a set of class specific linear SVMs.
Bounding-box Regression
Based on the error analysis, we implemented a simple method to reduce localization errors. Inspired by the bounding-box regression employed in DPM, we train a linear regression model to predict a new detection window given the pool5features for a selective search region proposal.
我们便知道R-CNN由三个部分组成:
1. 提取Region Proposals的模块