经典论文
- ImageNet分类
- 物体检测
- 物体跟踪
- 低级视觉
- 边缘检测
- 语义分割
- 视觉注意力和显著性
- 物体识别
- 人体姿态估计
- CNN原理和性质(Understanding CNN)
- 图像和语言
- 图像解说
- 视频解说
- 图像生成
微软ResNet
论文:用于图像识别的深度残差网络
作者:何恺明、张祥雨、任少卿和孙剑
链接:http://arxiv.org/pdf/1512.03385v1.pdf
微软PRelu(随机纠正线性单元/权重初始化)
论文:深入学习整流器:在ImageNet分类上超越人类水平
作者:何恺明、张祥雨、任少卿和孙剑
链接:http://arxiv.org/pdf/1502.01852.pdf
谷歌Batch Normalization
论文:批量归一化:通过减少内部协变量来加速深度网络训练
作者:Sergey Ioffe, Christian Szegedy
链接:http://arxiv.org/pdf/1502.03167.pdf
谷歌GoogLeNet
论文:更深的卷积,CVPR 2015
作者:Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich
链接:http://arxiv.org/pdf/1409.4842.pdf
牛津VGG-Net
论文:大规模视觉识别中的极深卷积网络,ICLR 2015
作者:Karen Simonyan & Andrew Zisserman
链接:http://arxiv.org/pdf/1409.1556.pdf
AlexNet
论文:使用深度卷积神经网络进行ImageNet分类
作者:Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
链接:http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
物体检测
PVANET
论文:用于实时物体检测的深度轻量神经网络(PVANET:Deep but Lightweight Neural Networks for Real-time Object Detection)
作者:Kye-Hyeon Kim, Sanghoon Hong, Byungseok Roh, Yeongjae Cheon, Minje Park
链接:http://arxiv.org/pdf/1608.08021
纽约大学OverFeat
论文:使用卷积网络进行识别、定位和检测(OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks),ICLR 2014
作者:Pierre Sermanet, David Eigen, Xiang Zhang, Michael Mathieu, Rob Fergus, Yann LeCun
链接:http://arxiv.org/pdf/1312.6229.pdf
伯克利R-CNN
论文:精确物体检测和语义分割的丰富特征层次结构(Rich feature hierarchies for accurate object detection and semantic segmentation),CVPR 2014
作者:Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik
微软SPP
论文:视觉识别深度卷积网络中的空间金字塔池化(Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition),ECCV 2014
作者:何恺明、张祥雨、任少卿和孙剑
链接:http://arxiv.org/pdf/1406.4729.pdf
微软Fast R-CNN
论文:Fast R-CNN
作者:Ross Girshick
链接:http://arxiv.org/pdf/1504.08083.pdf
微软Faster R-CNN
论文:使用RPN走向实时物体检测(Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks)
作者:任少卿、何恺明、Ross Girshick、孙剑
链接:http://arxiv.org/pdf/1506.01497.pdf
牛津大学R-CNN minus R
论文:R-CNN minus R
作者:Karel Lenc, Andrea Vedaldi
链接:http://arxiv.org/pdf/1506.06981.pdf
端到端行人检测
论文:密集场景中端到端的行人检测(End-to-end People Detection in Crowded Scenes)
作者:Russell Stewart, Mykhaylo Andriluka
链接:http://arxiv.org/pdf/1506.04878.pdf
实时物体检测
论文:你只看一次:统一实时物体检测(You Only Look Once: Unified, Real-Time Object Detection)
作者:Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi
链接:http://arxiv.org/pdf/1506.02640.pdf
Inside-Outside Net
论文:使用跳跃池化和RNN在场景中检测物体(Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks)
作者:Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick
链接:http://arxiv.org/abs/1512.04143.pdf
微软ResNet
论文:用于图像识别的深度残差网络
作者:何恺明、张祥雨、任少卿和孙剑
链接:http://arxiv.org/pdf/1512.03385v1.pdf
R-FCN
论文:通过区域全卷积网络进行物体识别(R-FCN: Object Detection via Region-based Fully Convolutional Networks)
作者:代季峰,李益,何恺明,孙剑
链接:http://arxiv.org/abs/1605.06409
SSD
论文:单次多框检测器(SSD: Single Shot MultiBox Detector)
作者:Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg
链接:http://arxiv.org/pdf/1512.02325v2.pdf
速度/精度权衡
论文:现代卷积物体检测器的速度/精度权衡(Speed/accuracy trade-offs for modern convolutional object detectors)
作者:Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy
链接:http://arxiv.org/pdf/1611.10012v1.pdf
物体跟踪
- 论文:用卷积神经网络通过学习可区分的显著性地图实现在线跟踪(Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network)
作者:Seunghoon Hong, Tackgeun You, Suha Kwak, Bohyung Han
地址:arXiv:1502.06796.
- 论文:DeepTrack:通过视觉跟踪的卷积神经网络学习辨别特征表征(DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking)
作者:Hanxi Li, Yi Li and Fatih Porikli
发表: BMVC, 2014.
- 论文:视觉跟踪中,学习深度紧凑图像表示(Learning a Deep Compact Image Representation for Visual Tracking)
作者:N Wang, DY Yeung
发表:NIPS, 2013.
- 论文:视觉跟踪的分层卷积特征(Hierarchical Convolutional Features for Visual Tracking)
作者:Chao Ma, Jia-Bin Huang, Xiaokang Yang and Ming-Hsuan Yang
发表: ICCV 2015
- 论文:完全卷积网络的视觉跟踪(Visual Tracking with fully Convolutional Networks)</