计算机视觉的300多项优质资源

A curated collection of 300+ awesome computer vision resources including books, courses, papers, tutorials, software and more.

Due to the size of this list, it can be hard to keep up with broken links, so if you come across any, please let me know in the comments section.

Also, if you know of any more awesome computer vision resources than what is on this list, please let me know in the comments section.

TABLE OF CONTENTS

BOOKS

COMPUTER VISION
OPENCV PROGRAMMING
MACHINE LEARNING
FUNDAMENTALS

COURSES

COMPUTER VISION
COMPUTATIONAL PHOTOGRAPHY
MACHINE LEARNING AND STATISTICAL LEARNING
OPTIMIZATION

PAPERS

CONFERENCE PAPERS ON THE WEB
SURVEY PAPERS

TUTORIALS AND TALKS

COMPUTER VISION
CONFERENCE TALKS
3D COMPUTER VISION
INTERNET VISION
COMPUTATIONAL PHOTOGRAPHY
LEARNING AND VISION
OBJECT RECOGNITION
GRAPHICAL MODELS
MACHINE LEARNING
OPTIMIZATION
DEEP LEARNING

SOFTWARE

EXTERNAL RESOURCE LINKS
GENERAL PURPOSE COMPUTER VISION LIBRARY
MULTIPLE-VIEW COMPUTER VISION
FEATURE DETECTION AND EXTRACTION
  • VLFeat
  • SIFT – David G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, 60, 2 (2004), pp. 91-110.
  • SIFT++
  • BRISK – Stefan Leutenegger, Margarita Chli and Roland Siegwart, “BRISK: Binary Robust Invariant Scalable Keypoints”, ICCV 2011
  • SURF – Herbert Bay, Andreas Ess, Tinne Tuytelaars, Luc Van Gool, “SURF: Speeded Up Robust Features”, Computer Vision and Image Understanding (CVIU), Vol. 110, No. 3, pp. 346–359, 2008
  • FREAK – A. Alahi, R. Ortiz, and P. Vandergheynst, “FREAK: Fast Retina Keypoint”, CVPR 2012
  • AKAZE – Pablo F. Alcantarilla, Adrien Bartoli and Andrew J. Davison, “KAZE Features”, ECCV 2012
  • Local Binary Patterns
HIGH DYNAMIC RANGE IMAGING
SEMANTIC SEGMENTATION
LOW-LEVEL VISION
STEREO VISION
OPTICAL FLOW
SUPER-RESOLUTION
  • Multi-frame image super-resolution – Pickup, L. C. Machine Learning in Multi-frame Image Super-resolution, PhD thesis 2008
  • Markov Random Fields for Super-Resolution – W. T Freeman and C. Liu. Markov Random Fields for Super-resolution and Texture Synthesis. In A. Blake, P. Kohli, and C. Rother, eds., Advances in Markov Random Fields for Vision and Image Processing, Chapter 10. MIT Press, 2011
  • Sparse regression and natural image prior – K. I. Kim and Y. Kwon, “Single-image super-resolution using sparse regression and natural image prior”, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 6, pp. 1127-1133, 2010.
  • Single-Image Super Resolution via a Statistical Model – T. Peleg and M. Elad, A Statistical Prediction Model Based on Sparse Representations for Single Image Super-Resolution, IEEE Transactions on Image Processing, Vol. 23, No. 6, Pages 2569-2582, June 2014
  • Sparse Coding for Super-Resolution – R. Zeyde, M. Elad, and M. Protter On Single Image Scale-Up using Sparse-Representations, Curves & Surfaces, Avignon-France, June 24-30, 2010 (appears also in Lecture-Notes-on-Computer-Science – LNCS).
  • Patch-wise Sparse Recovery – Jianchao Yang, John Wright, Thomas Huang, and Yi Ma. Image super-resolution via sparse representation. IEEE Transactions on Image Processing (TIP), vol. 19, issue 11, 2010.
  • Neighbor embedding – H. Chang, D.Y. Yeung, Y. Xiong. Super-resolution through neighbor embedding. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), vol.1, pp.275-282, Washington, DC, USA, 27 June – 2 July 2004.
  • Deformable Patches – Yu Zhu, Yanning Zhang and Alan Yuille, Single Image Super-resolution using Deformable Patches, CVPR 2014
  • SRCNN – Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Learning a Deep Convolutional Network for Image Super-Resolution, in ECCV 2014
  • A+: Adjusted Anchored Neighborhood Regression – R. Timofte, V. De Smet, and L. Van Gool. A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution, ACCV 2014
  • Transformed Self-Exemplars – Jia-Bin Huang, Abhishek Singh, and Narendra Ahuja, Single Image Super-Resolution using Transformed Self-Exemplars, IEEE Conference on Computer Vision and Pattern Recognition, 2015
IMAGE DEBLURRING

Non-blind deconvolution

Blind deconvolution

Non-uniform Deblurring

IMAGE COMPLETION
IMAGE RETARGETING
ALPHA MATTING
IMAGE PYRAMID
EDGE-PRESERVING IMAGE PROCESSING
INTRINSIC IMAGES
CONTOUR DETECTION AND IMAGE SEGMENTATION
INTERACTIVE IMAGE SEGMENTATION
VIDEO SEGMENTATION
CAMERA CALIBRATION
SIMULTANEOUS LOCALIZATION AND MAPPING
SLAM COMMUNITY:
TRACKING/ODOMETRY:
GRAPH OPTIMIZATION:
LOOP CLOSURE:
LOCALIZATION & MAPPING:
SINGLE-VIEW SPATIAL UNDERSTANDING
OBJECT DETECTION
NEAREST NEIGHBOR SEARCH
GENERAL PURPOSE NEAREST NEIGHBOR SEARCH
NEAREST NEIGHBOR FIELD ESTIMATION
VISUAL TRACKING
IMAGE CAPTIONING
  • NeuralTalk – NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
OPTIMIZATION
  • Ceres Solver – Nonlinear least-square problem and unconstrained optimization solver
  • NLopt– Nonlinear least-square problem and unconstrained optimization solver
  • OpenGM – Factor graph based discrete optimization and inference solver
  • GTSAM – Factor graph based lease-square optimization solver
MACHINE LEARNING

DATASETS

EXTERNAL DATASET LINK COLLECTION
LOW-LEVEL VISION
STEREO VISION
OPTICAL FLOW
IMAGE SUPER-RESOLUTIONS
INTRINSIC IMAGES
MATERIAL RECOGNITION
MULTI-VIEW RECONSTURCTION
VISUAL TRACKING
VISUAL SURVEILLANCE
CHANGE DETECTION
VISUAL RECOGNITION
IMAGE CLASSIFICATION
SCENE RECOGNITION
OBJECT DETECTION
SEMANTIC LABELING
MULTI-VIEW OBJECT DETECTION
FINE-GRAINED VISUAL RECOGNITION
PEDESTRIAN DETECTION
ACTION RECOGNITION
VIDEO-BASED
IMAGE DEBLURRING
IMAGE CAPTIONING
SCENE UNDERSTANDING
  • SUN RGB-D – A RGB-D Scene Understanding Benchmark Suite
  • NYU depth v2 – Indoor Segmentation and Support Inference from RGBD Images

RESOURCES FOR STUDENTS

RESOURCE LINK COLLECTION
WRITING
PRESENTATION
RESEARCH
TIME MANAGEMENT

LINKS

├─1.计算机视觉简介、环境准备(python, ipython) │ computer vsion.pdf │ CS231 introduction.pdf │ ├─2.图像分类问题简介、kNN分类器、线性分类器、模型选择 │ 2. 图像分类简介、kNN与线性分类器、模型选择.mp4 │ 2.初识图像分类.pdf │ ├─3.再谈线性分类器 │ 3.再谈线性分类器.mp4 │ 再谈线性分类器.pdf │ ├─4.反向传播算法和神经网络简介 │ .反向传播算法和神经网络简介.pdf │ 4. 反向传播算法和神经网络简介.mp4 │ ├─5.神经网络训练1 │ 5.-神经网络训练1.pdf │ 5.神经网络训练1.mp4 │ ├─6.神经网络训练2、卷积神经网络简介 │ 6.神经网络训练2.mp4 │ 神经网络训练2.pdf │ ├─7.卷积神经网络 │ 7.卷积神经网络.mp4 │ Lession7.pdf │ ├─8.图像OCR技术的回顾、进展及应用前景 │ 8.图像OCR技术的回顾、进展及应用前景.mp4 │ PhotoOCR_xbai.pdf │ └─9.物体定位检测 物体定位检测.pdf │ ├─10.卷积神经网络可视化 │ .卷积神经网络可视化.pdf │ 10.卷积神经网络可视化.mp4 │ ├─11.循环神经网络及其应用 │ 11.循环神经网络及其应用.mp4 │ 循环神经网络.pdf │ ├─12.卷积神经网络实战 │ 12.卷积神经网络训练实战.mp4 │ 卷积神经网络实战.pdf │ ├─13.常见深度学习框架介绍 │ 常见深度学习框架介绍.pdf │ ├─14.图像切割 │ 14.图像切割.mp4
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值