SSD(Single Shot MultiBox Detector)理论学习

一、Performance of SSD and other mainstream algorithms.

Method(VOC2007 test)FPSmAP
SSD5972.1%
Faster R-CNN773.2%
YOLO4563.4%

According to the paper of SSD, the fundamental improvement of ssd in speed comes from eliminating bounding box proposals and the subsequent pixel or feature resampling stage. SSD isn’t the first paper to do this, but they add a series of improvements.
This improvements include:
1、using a small convolutional filter to predict object categories and offsets in bounding box locations
2、applying these filters to multiple feature maps from the later stages of a network in order to perform detection at multiple scales.

二、Contributions of SSD

1、It is faster and more accurate than the previous state of the art for single shot detector(YOLO). And It is as accurate as slower techniques that perform explicit region proposals and pooling(including faster R-CNN).
2、The cores of SSD is predicting categories scores and bounding box offsets for a fixed set of prior bounding boxes(different aspect ratio) using small convolutional filters applied to multi feature maps(improve the accuracy).
3、End-to-end train.

三、SSD convolutional predictor

这里写图片描述
As show in the picture,at each selected feature map, there will be three branches.
1、prior bounding boxes offsets prediction
2、prior bounding boxes confidence prediction
3、prior bounding boxes
For examples, if the size of feature map(feturemap_pool6) is m×n×c, and the basic element for predicting parameters of a potential detection is a 3×3×c small kernel that produces either offsets prediction or confidence prediction.At each of the m×n locations where pool6_mbox_priorbox produces k prior bounding boxes. So the the number of prior bounding boxes is mnk 、the output_nums of pool6_mbox_loc(convolution layer) is 4k and he output_nums of pool6_mbox_conf(convolution layer) is classes*k.

四、Training

1、 Matching strategy
At training time it need to establish the correspondence between the ground truth and the default boxes. It selected default box matching each ground truth with the best jaccard overlap. This is the matching approach used by the original MultiBox and it ensures that each ground truth box has exactly one matched default box. Unlike MultiBox, SSD match default boxes to any ground truth with jaccard overlap higher than a threshold(0.5).Adding these matches simplifies the learning problem: it allows the network to predict high confidence for multiple overlapping default boxes rather than requiring it to pick only the on with maximum overlap.
2、Training objective

L(x,c,l,g)=1N ((L conf (x,c))+αL loc (x,l,g)) 

后面待续

  • 1
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 3
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 3
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值