Object Detection《faster-rcnn》笔记(4)

Faster R-CNN: Towards real-time object detection with region proposal networks

说明: introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals。An RPN is a fully-convolutional network that simultaneously predicts object bounds and objectness scores at each position.

Introduction

1,region proposals are the computational bottleneck in state-of-the-art detection systems。

Region Proposal Networks

1,A Region Proposal Network (RPN) takes an image (of any size) as input and outputs a set of rectangular object proposals, each with an objectness score
2,This network is fully connected to an n × n spatial window of the input conv feature map. Each sliding window is mapped to a lower-dimensional vector (256-d for ZF and 512-d for VGG). This vector is fed into two sibling fully-connected layers—a box-regression layer (reg) and a box-classification layer (cls)。
3,predict k region proposals, so the reg layer,has 4k outputs encoding the coordinates of k boxes
这里写图片描述

A Loss Function for Learning Region Proposals

assign a positive label to two kinds of anchors: (i) the anchor/anchors with the highest Intersectionover-Union (IoU) overlap with a ground-truth box, or (ii) an anchor that has an IoU overlap higher than 0.7 with any ground-truth box.
这里写图片描述

Optimization

1,The RPN, which is naturally implemented as a fully-convolutional network, can be trained end-to-end by back-propagation and stochastic gradient descent (SGD)
2,we randomly sample 256 anchors in an image to compute the loss function of a mini-batch, where the sampled positive and negative anchors have a ratio of up to 1:1
3,all new layers的权值初始化:高斯分布(μ=0,σ=0.01),all other layers(比如共享卷积层)用ImageNet来权值初始化。用ZF net来进行进行微调。学习率:0.001(60k)->0.0001(20k)。动量:0.9。weight decay: 0.0005。

Sharing Convolutional Features for Region Proposal and Object Detection

1,sharing convolutional layers between the two networks, rather than learning two separate networks
2, 4-step training algorithm to learn shared features via alternating optimization

Implementation Details

1,Multi-scale与speed-accuracy之间的trade-off
2,To reduce redundancy, we adopt non-maximum suppression (NMS) on the proposal regions based on their cls scores.

Faster R-CNN

这里写图片描述

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值