检测回顾一之yolov2

yolov2论文解读

贡献

  1. Using a novel, multi-scale training method the same YOLOv2 model can run at varying sizes, offering an easy tradeoff between speed and accuracy
  2. we propose a method to jointly train on object detection and classification。 it predicts detections for more than 9000 different object categories. And it still runs in real-time。

Better改进

  1. By adding batch normalization on all of the convolutional layers in YOLO we get more than 2% improvement in mAP。

  2. we first fine tune the classification network at the full 448×448 resolution for 10 epochs on ImageNet, then fine tune the resulting network on detection,this gives us an increase of almost 4% mAP.。

  3. We remove the fully connected layers from YOLO and
    use anchor boxes to predict bounding boxes

    • we eliminate one pooling layer to make the output of the network’s convolutional layers higher resolution
    • We alsoshrink the network to operate on 416 input images instead of 448×448.We do this because we want an odd number of locations in our feature map so there is a single center cell.
    • we also decouple the class prediction mechanism from the spatial location and instead predict class and objectness for every anchor box。
    • With anchor boxes our model gets 69.2 mAP
      with a recall of 88%
  4. we run k-meansclustering on the training set bounding boxes to automatically find good priors。for our distance metric we use: d(box,centroid) = 1 − IOU(box,centroid)

  5. we predict location coordinates relative to the location of the grid cell. This bounds the ground truth to fall between 0 and 1. We use a logistic activation to constrain the network’s predictions to fall in this range.The network predicts 5 coordinates for each bounding box, t x , t y , t w , t h t_x, t_y, t_w, t_h tx,ty,tw,th, and t o t_o to. If the cell is offset from the top left corner of the image by ( c x , c y ) (c_x, c_y) (cx,cy) and the bounding box prior has width and height p w , p h p_w, p_h pw,ph, then the predictions correspond to:
    在这里插入图片描述
    在这里插入图片描述Using dimension clusters along with directly predicting the bounding box center location improves YOLO by almost 5% over the version with anchor boxes.

  6. We take a different approach, simply adding a passthrough layer that brings features from an earlier layer at 26 × 26 resolution.This gives a modest 1% performance increase

  7. Instead of fixing the input image size we change the network every few iterations. Every 10 batches our network randomly chooses a new image dimension size

Faster改进

  1. We propose a new classification model to
    be used as the base of YOLOv2, called Darknet-19, has 19 convolutional layers and 5 maxpooling layers
    在这里插入图片描述

Stronger改进

  1. We propose a mechanism for jointly training on classification and detection data.During training we mix images from both detection and classification datasets. When our network sees an image labelled for detection we can backpropagate based on the
    full YOLOv2 loss function. When it sees a classification image we only backpropagate loss from the classification-specific parts of the architecture
  2. Hierarchical classification,
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值