论文笔记之网络结构篇:RCNN

最新推荐文章于 2023-02-17 18:57:33 发布

eight_Jessen

最新推荐文章于 2023-02-17 18:57:33 发布

阅读量222

点赞数

分类专栏：论文笔记文章标签：深度学习 pytorch 机器学习神经网络

本文链接：https://blog.csdn.net/eight_Jessen/article/details/107945424

版权

论文笔记专栏收录该内容

49 篇文章 7 订阅

订阅专栏

PAST: Combine multiplle low-level image features with high-level context

Key insights:

CNN ---- bottom-up region proposals in order to localize and segment objects
labeled training data is scare ---- supervised pre-training for anauxiliary task, followed by domain-speciﬁc ﬁne-tuning, yields a signiﬁcant performance boost.

The conventional solution to this problem is to use unsupervised pre-training,followed by supervised ﬁne-tuning .
supervised pre-training on a large auxiliary dataset(ILSVRC),followed by domain speciﬁc ﬁne-tuning on a small dataset (PASCAL), is an effective paradigm for learning high-capacity CNNs when data is scarce.

Object detection with R-CNN

Three module:

generates category-independent region proposals
a large convolutional neural network that extracts a ﬁxed-length feature vector from each region
a set of class speciﬁc linear SVMs.

2.1 Module design

Region proposals:

selective search

Feature extraction

2.2 Test-time detection

proposal 2000 region proposals ——> for each class, score each extracted feature vector --> non-maximum suppression

Run-time analysis

efficient

all CNN parameters are shared across all categories

the feature vectors computed by the CNN are low-dimensional

2.3 Training

Supervised pre-training

pre-trained the CNN on a large auxiliary dataset (ILSVRC2012 classiﬁcation) using image-level annotations only (boundingbox labels are not available for this data).

Domain-specific fine-tuning

replacing the CNN’s ImageNetspeciﬁc 1000-way classiﬁcation layer with a randomly initialized (N + 1)-way classiﬁcation layer (where N is the number of object classes, plus 1 for background)