deep learning object detection 精选

deep learning object detection

Paper list from 2014 to 2019

deep_learning_object_detection_history

Milestones

milestone

milestone

Object detector composed parts

  • Input: Image, Patches, Image Pyramid
  • Backbones: VGG16, ResNet-50, SpineNet, EfficientNet-B0/B7 , CSPResNeXt50, CSPDarknet53
  • Neck:
    • Additional blocks: SPP, ASPP, RFB, SAM
    • Path-aggregation blocks: FPN, PAN, NAS-FPN, Fully-connected FPN, BiFPN, ASFF, SFAM
  • Heads:
    • Dense Prediction (one-stage):
      • RPN, SSD, YOLO, RetinaNet (anchor based)
      • CornerNet, CenterNet, MatrixNet, FCOS (anchor free)
    • Sparse Prediction (two-stage):
      • Faster R-CNN, R-FCN, Mask R-CNN (anchor based)
      • RepPoints (anchor free)

Detection methods category

Object detection steps

One-Stage
  1. Extracts feature on all area of image, classify the objects,

    localize bounding-box

Two-Stage
  1. Generates category-independent region proposals,

    extracts feature vector from each region proposal

  2. Classify the objects, precisely bounding-box prediction (NMS)

Small object detection tricks

  • Framework for small object detection

    • Multi-scale Feature Learning

  • Enhance the Receptive Fields (visual attention mechanisms)

  • Data Augmentation

    • GAN-based Detection
    • Flipping, cropping, rotating, scaling
  • Training Strategy

    • Unsupervised object detection
    • Weakly Supervised Object Detection
    • Multi-Scale Training/Val/Test
    • GPU accelerate
  • Context-based Detection

    • Local context
    • Global context
    • Context interactive
  • Neural Architecture Search

    • Stacking more pyramid networks
    • Adding feature dimension
    • Adopting high capacity architecture
  • Efficient post-processing methods

    • Non maximum suppression (NMS)
    • Soft-NMS
  • Deformable convolutional networks

  • Multi-task joint learning and optimization

    • Object detection
    • Semantic segmentation
    • Instance segmentation
    • Edge detection
    • Highlight detection
  • Establish small object datasets

Performance table

FPS(Speed) index is related to the hardware spec(e.g. CPU, GPU, RAM, etc), so it is hard to make an equal comparison. The solution is to measure the performance of all models on hardware with equivalent specifications, but it is very difficult and time consuming.

DetectorCOCO (mAP@IoU=0.5:0.95)Published In
R-CNN-CVPR’14
Fast R-CNN19.7ICCV’15
Faster R-CNN21.9NIPS’15
YOLO v1-CVPR’16
SSD31.2ECCV’16
R-FCN29.9NIPS’16
FPN36.2CVPR’17
YOLO v2-CVPR’17
RetinaNet39.1ICCV’17
Mask R-CNN39.8ICCV’17
Soft-NMS40.9ICCV’17
YOLO v333.0arXiv’18
RefineDet41.8CVPR’18
Cascade R-CNN42.8CVPR’ 18
RFBNet-ECCV’18
Softer-NMS-arXiv’ 18
SNIPER43.5NIPS’ 18
M2Det44.2AAAI’19
Libra R-CNN43.0CVPR’19
FSAF44.6CVPR’19
ExtremeNet43.7CVPR’19
CenterNet45.1ICCV’19
FreeAnchor44.8NeurIPS’19
CBNet53.3AAAI’20
YOLOv4-arXiv’20
ATSS50.7CVPR’ 20
Hit-Detector41.4CVPR’ 20
DetectoRS54.7arXiv’20

Performance on MS COCO

MS COCO detection evaluation metrics

2014

2015

2016

2017

2018

2019

  • [M2Det] M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network | [AAAI’ 19] |[pdf] [official code - pytorch] Multi-scale Feature Learning

  • [Libra R-CNN] Libra R-CNN: Balanced Learning for Object Detection | [CVPR’ 19] |[pdf] Training Strategy

  • [FSAF] Feature Selective Anchor-Free Module for Single-Shot Object Detection | [CVPR’ 19] |[pdf] Anchor-Free

  • [ExtremeNet] Bottom-up Object Detection by Grouping Extreme and Center Points | [CVPR’ 19] |[pdf] | [official code - pytorch] Instance Segmentation

  • [CenterNet] CenterNet: Keypoint Triplets for Object Detection | [ICCV’ 19] |[pdf] Keypoint-based detector

  • [FreeAnchor] FreeAnchor: Learning to Match Anchors for Visual Object Detection | [NeurIPS’ 19] |[pdf] Anchor-Free

2020

  • [CBnet] Cbnet: A novel composite backbone network architecture for object detection | [AAAI’ 20] |[pdf] Composite Backbone Network
  • [YOLOv4] YOLOv4: Optimal Speed and Accuracy of Object Detection | [arXiv’ 20] |[pdf]
    • Input: Mosaic data augmentation, Cross mini-Batch Normalization (CmBN), Self-adversarial-training (SAT)
    • BackBone: CSPDarknet53, Mish-activation, DropBlock regularization
    • Neck: SPP block, PAN (path-aggregation block)
    • Prediction: CIoU-loss, DIoU-NMS
  • [ATSS] Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection | [CVPR’ 20] |[pdf] Anchor-Based Training Strategy
  • [Hit-Detector] Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection | [CVPR’ 20] |[pdf] Neural Architecture Search
  • [DetectoRS] DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution | [arXiv’ 20] |[pdf] Recursive Feature Pyramid Switchable Atrous Convolution Instance Segmentation

Survey

  • Recent advances in small object detection based on deep learning: A review [pdf]
  • A Survey of Deep Learning-based Object Detection [pdf]
  • Object Detection in 20 Y ears: A Survey [pdf]
  • Recent Advances in Deep Learning for Object Detection [pdf]

Analyze Tools

  • 1
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值