行人检测--What Can Help Pedestrian Detection?

最新推荐文章于 2024-07-16 13:34:36 发布

O天涯海阁O

最新推荐文章于 2024-07-16 13:34:36 发布

阅读量3.9k

点赞数

分类专栏：人 CVPR2017

本文链接：https://blog.csdn.net/zhangjunhit/article/details/77524559

版权

CVPR2017 同时被 2 个专栏收录

46 篇文章 20 订阅

订阅专栏

人

12 篇文章 0 订阅

订阅专栏

What Can Help Pedestrian Detection?
CVPR2017

本文主要分析 extra features 对于基于CNN 的行人检测有什么帮助，设计了一个行人检测网络 HyperLearner 可以有效利用这些 extra features

这里的 extra features 主要指 various channel features

这里写图片描述

行人检测中存在什么问题了？
这里写图片描述
一个是低分辨率情况的背景和小尺寸的人很难区分，另一个是多个人在一起的时候基于CNN的单个行人的检测定位误差相对较大

这里我们将 channel features 融入到 Faster R-CNN 中去
这里写图片描述

3.2. Introduction to channel features

各种 channel features 分为三类： apparent-to-semantic channels, temporal channels and depth channels
1） apparent-to-semantic channels 主要包括 ICF channel [10]( colors and gradients), edge channel, segmentation channel ( FCN )and heatmap channel( blur the segmentation channel )，特征从 low-level apparent to high-level semantic
2) Temporal channels 包括 optical flow [1] and motion [29]
3）Depth channels disparity channel

这里写图片描述
分割信息对行人检测的帮助最大

HyperLearner 网络
这里写图片描述
这里的 Aggregated activation map 是怎么得到了？将 body network 中不同尺寸的特征图归一化到同一个尺寸、相同通道数目，组合起来得到 Aggregated activation map，这个 Aggregated activation map 作为 channel feature network (CFN) 的输入得到 Channel Feature，body network 的最终输出加上 Aggregated activation map 作为 region proposal network (RPN) and Fast R-CNN (FRCNN) 的输入。