目标检测相关文章及源码

【目标识别】深度学习进行目标识别的资源列表:O网页链接 包括RNN、MultiBox、SPP-Net、DeepID-Net、Fast R-CNN、DeepBox、MR-CNN、Faster R-CNN、YOLO、DenseBox、SSD、Inside-Outside Net、G-CNN等。
Papers
Deep Neural Networks for Object Detection
OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
[td]
method
ILSVRC 2013 mAP
OverFeat
24.3%
R-CNN
Rich feature hierarchies for accurate object detection and semantic segmentation(R-CNN)
[td]
method
VOC 2007 mAP
VOC 2010 mAP
VOC 2012 mAP
ILSVRC 2013 mAP
R-CNN,AlexNet
54.2%
50.2%
49.6%

R-CNN,bbox reg,AlexNet
58.5%
53.7%
53.3%
31.4%
R-CNN,bbox reg,ZFNet
59.2%

 
R-CNN,VGG-Net
62.2%

 
R-CNN,bbox reg,VGG-Net
66.0%

 
MultiBox
Scalable Object Detection using Deep Neural Networks (MultiBox)
SPP-Net
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
[td]
method
VOC 2007 mAP
ILSVRC 2013 mAP
SPP_net(ZF-5),1-model
54.2%
31.84%
SPP_net(ZF-5),2-model
60.9%

SPP_net(ZF-5),6-model
35.11%
Learning Rich Features from RGB-D Images for Object Detection and Segmentation
Scalable, High-Quality Object Detection
DeepID-Net
DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection
[td]
method
VOC 2007 mAP
ILSVRC 2013 mAP
DeepID-Net
64.1%
50.3%
Object Detection Networks on Convolutional Feature Maps
[td]
method
Trained on
mAP
NoC
07+12
68.8%
NoC,bb
07+12
71.6%
NoC,+EB
07+12
71.8%
NoC,+EB,bb
07+12
73.3%
Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction
[td]
Model
BBoxReg?
VOC 2007 mAP(IoU>0.5)
R-CNN(AlexNet)
No
54.2%
R-CNN(VGG)
No
60.6%
+StructObj
No
61.2%
+StructObj-FT
No
62.3%
+FGS
No
64.8%
+StructObj+FGS
No
65.9%
+StructObj-FT+FGS
No
66.5%
[td]
Model
BBoxReg?
VOC 2007 mAP(IoU>0.5)
R-CNN(AlexNet)
Yes
58.5%
R-CNN(VGG)
Yes
65.4%
+StructObj
Yes
66.6%
+StructObj-FT
Yes
66.9%
+FGS
Yes
67.2%
+StructObj+FGS
Yes
68.5%
+StructObj-FT+FGS
Yes
68.4%
Fast R-CNN
Fast R-CNN
[td]
method
data
VOC 2007 mAP
FRCN,VGG16
07
66.9%
FRCN,VGG16
07+12
70.0%
[td]
method
data
VOC 2010 mAP
FRCN,VGG16
12
66.1%
FRCN,VGG16
07++12
68.8%
[td]
method
data
VOC 2012 mAP
FRCN,VGG16
12
65.7%
FRCN,VGG16
07++12
68.4%
DeepBox
DeepBox: Learning Objectness with Convolutional Networks
MR-CNN
Object detection via a multi-region & semantic segmentation-aware CNN model (MR-CNN)
[td]
Model
Trained on
VOC 2007 mAP
VGG-net
07+12
78.2%
VGG-net
07
74.9%
[td]
Model
Trained on
VOC 2012 mAP
VGG-net
07+12
73.9%
VGG-net
12
70.7%
Faster R-CNN
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks(NIPS 2015)
[td]

training data
test data
mAP
time/img
Faster RCNN, VGG-16
07
VOC 2007 test
69.9%
198ms
Faster RCNN, VGG-16
07+12
VOC 2007 test
73.2%
198ms
Faster RCNN, VGG-16
12
VOC 2007 test
67.0%
198ms
Faster RCNN, VGG-16
07++12
VOC 2007 test
70.4%
198ms
YOLO
You Only Look Once: Unified, Real-Time Object Detection(YOLO)
R-CNN minus R
DenseBox
DenseBox: Unifying Landmark Localization with End to End Object Detection
SSD
SSD: Single Shot MultiBox Detector
Inside-Outside Net
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Detection results on VOC 2007 test:
[td]
Method
R
S
W
D
Train
mAP
FRCN

 
 
07+12
70.0
RPN

 
 
07+12
73.2
MR-CNN

 

07+12
78.2
ION

 
 
07+12
74.6
ION

 
07+12
75.6
ION

 
07+12+S
76.5
ION

07+12+S
78.5
ION
07+12+S
79.2
Detection results on VOC 2012 test:
[td]
Method
R
S
W
D
Train
mAP
FRCN

 
 
07++12
68.4
RPN

 
 
07++12
70.4
FRCN+YOLO

 
 
07++12
70.4
HyperNet

 
 
07++12
71.4
MR-CNN

 

07+12
73.9
ION
07+12+S
76.4
G-CNN
G-CNN: an Iterative Grid Based Object Detector
Learning Deep Features for Discriminative Localization
Factors in Finetuning Deep Model for object detection
We don’t need no bounding-boxes: Training object class detectors using only human verification
A MultiPath Network for Object Detection
Beyond Bounding Boxes: Precise Localization of Objects in Images (PhD Thesis)
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos
Training Region-based Object Detectors with Online Hard Example Mining
Specific Object Deteciton
End-to-end people detection in crowded scenes
Tutorials
Convolutional Feature Maps: Elements of efficient (and accurate) CNN-based object detection
Codes
TensorBox: a simple framework for training neural networks to detect objects in images
Object detection in torch: Implementation of some object detection frameworks in torch
Blogs
Convolutional Neural Networks for Object Detection
import cv2 as cv import numpy as np def scan_edge_demo(img): gray = cv.cvtColor(img,cv.COLOR_BGR2GRAY) gray = cv.GaussianBlur(gray,(3,3),0) scan_edge = cv.Canny(gray,60,150) return scan_edge def scan_contours(img): scan_edge =scan_edge_demo(img) aa,contours,b= cv.findContours(scan_edge,cv.RETR_TREE,cv.CHAIN_APPROX_SIMPLE) cv.drawContours(img,contours,-1,(0,255,255),thickness=4) cv.imshow("scan_contours",img) src = cv.imread("E:/opencv/picture/taijie.png") cv.imshow("inital_window",src) scan_contours(src) cv.waitKey(0) cv.destroyAllWindows() 图片: 分析: 1.Opencv发现轮廓的函数原型为:findContours(image, mode, method[, contours[, hierarchy[, offset]]]) -> image, contours, hierarchy image参数表示8位单通道图像矩阵,可以是灰度图,但更常用的是二值图像,一般是经过Canny、拉普拉斯等边缘检测算子处理过的二值图像。 所以输入源需要二值化(threshold)处理或者边缘处理canny后才行 mode参数表示轮廓检索模式: ①CV_RETR_EXTERNAL:只检测最外围轮廓,包含在外围轮廓内的内围轮廓被忽略。 ②CV_RETR_LIST:检测所有的轮廓,包括内围、外围轮廓,但是检测到的轮廓不建立等级关系,彼此之间独立,没有等级关系,这就意味着这个检索模式下不存在父轮廓或内嵌轮廓。 ③CV_RETR_CCOMP:检测所有的轮廓,但所有轮廓只建立两个等级关系,外围为顶层,若外围内的内围轮廓还包含了其他的轮廓信息,则内围内的所有轮廓均归属于顶层。 ④CV_RETR_TREE:检测所有轮廓,所有轮廓建立一个等级树结构,外层轮廓包含内层轮廓,内层轮廓还可以继续包含内嵌轮廓。 method参数表示轮廓的近似方法: ①CV_CHAIN_APPROX_NONE 存储所有的轮廓点,相邻的两个点的像素位置差不超过1,即max (abs (x1 - x2), abs(y2 - y1) == 1。 ②CV_CHAIN_APPROX_SIMPLE压缩水平方向,垂直方向,对角线方向的元素,只保留该方向的终点坐标,例如一个矩形轮廓只需4个点来保存轮廓信息。 ③CV_CHAIN_APPROX_TC89_L1,CV_CHAIN_APPROX_TC89_KCOS使用teh-Chinl chain 近似算法。 contours参数是一个list,表示存储的每个轮廓的点集合。 hierarchy参数是一个list,list中元素个数和轮廓个数相同,每个轮廓contours[i]对应4个hierarchy元素hierarchy[i][0] ~hierarchy[i][3],分别表示后一个轮廓、前一个轮廓、父轮廓、内嵌轮廓的索引编号,如果没有对应项,则该值为负数。 offset参数表示每个轮廓点移动的可选偏移量。 2.Opencv绘制轮廓的函数原型为:drawContours(image, contours, contourIdx, color[, thickness[, lineType[, hierarchy[, maxLevel[, offset]]]]]) -> image imgae参数表示目标图像。 contours参数表示所有输入轮廓。 contourIdx参数表示绘制轮廓list中的哪条轮廓, 如果是负数,则绘制所有轮廓。 color参数表示轮廓的颜色。 thickness参数表示绘制的轮廓线条粗细,如果是负数,则绘制轮廓内部。 lineType参数表示线型。 hierarchy参数表示有关层次结构的可选信息。 maxLevel参数表示绘制轮廓的最大级别。 如果为0,则仅绘制指定的轮廓。 如果为1,则该函数绘制轮廓和所有嵌套轮廓。 如果为2,则该函数绘制轮廓,所有嵌套轮廓,所有嵌套到嵌套的轮廓,等等。 仅当有可用的层次结构时才考虑此参数。 offset参数表示可选的轮廓偏移参数,该参数可按指定的方式移动所有绘制的轮廓。 关于轮廓检测,什么的样的情况会被判断为轮廓呢? 答:因为在做轮廓检测之前需要进行二值化,所以对于图像的整个ROI区域只有黑白两个颜色,而下面两种情况会被检测作为轮廓: 1. 白色区域与黑色区域的边缘交接区域 2. 当背景为白色时,整个ROI区域的外边界就会被视为轮廓。(往往我们希望背景是黑色,所以如果出现这种情况时我们需要在二值化图像时对图像取反)。
评论 3
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值