计算机视觉------------图像分割,人脸检测,行人检测传统做法

图像分割

基于灰度值的两个特征:不连续性和相似性对图像进行分割

基于阈值的分割方法:小于某个阈值被认为背景,大于则认为前景

基于边缘的分割方法:找到属于两种区域边界线上的像素点的集合

基于区域的分割方法:

区域生长法:先定义一个区域,然后慢慢加入这个区域,如果均值和定义的区域的差超过最大像素灰度值距离时,则无法再加入这个区域

分水岭算法:彩色图像灰度化,再求梯度图,最后在梯度图的基础上进行分水岭算法,球的图像的边缘线

Graph Cut分割:基于图论的分割方法

GrabCut 分割:前景和背景都是高斯混合模型,用边界项来体现领域像素之间不连续的惩罚,通过K-means算法迭代获得

人脸检测

Haar-like特征模板:缺点需要特别多的特征:不同的模板(15种),不同的位置,不同缩放

Haar级联分类器:将多个强分类器连接在一起进行操作,强分类器由多个弱分类器组成

行人检测

HOG(方向梯度直方图):彩色图中,选梯度幅值最大的通道

Block拆分:16x16的block,步长是8,50%的重合,包含2x2个cell,每个cell是8x8

HOG的步骤:

将整个图像进行Gamma空间,颜色归一化

计算图像梯度

构建方向直方图

将细胞单元组合成大的区间

收集HOG特征

 

 

Human parsing has been extensively studied recently (Yamaguchi et al. 2012; Xia et al. 2017) due to its wide applications in many important scenarios. Mainstream fashion parsing models (i.e., parsers) focus on parsing the high-resolution and clean images. However, directly applying the parsers trained on benchmarks of high-quality samples to a particular application scenario in the wild, e.g., a canteen, airport or workplace, often gives non-satisfactory performance due to domain shift. In this paper, we explore a new and challenging cross-domain human parsing problem: taking the benchmark dataset with extensive pixel-wise labeling as the source domain, how to obtain a satisfactory parser on a new target domain without requiring any additional manual labeling? To this end, we propose a novel and efficient crossdomain human parsing model to bridge the cross-domain differences in terms of visual appearance and environment conditions and fully exploit commonalities across domains. Our proposed model explicitly learns a feature compensation network, which is specialized for mitigating the cross-domain differences. A discriminative feature adversarial network is introduced to supervise the feature compensation to effectively reduces the discrepancy between feature distributions of two domains. Besides, our proposed model also introduces a structured label adversarial network to guide the parsing results of the target domain to follow the high-order relationships of the structured labels shared across domains. The proposed framework is end-to-end trainable, practical and scalable in real applications. Extensive experiments are conducted where LIP dataset is the source domain and 4 different datasets including surveillance videos, movies and runway shows without any annotations, are evaluated as target domains. The results consistently confirm data efficiency and performance advantages of the proposed method for the challenging cross-domain human parsing problem. Abstract—This paper presents a robust Joint Discriminative appearance model based Tracking method using online random forests and mid-level feature (superpixels). To achieve superpixel- wise discriminative ability, we propose a joint appearance model that consists of two random forest based models, i.e., the Background-Target discriminative Model (BTM) and Distractor- Target discriminative Model (DTM). More specifically, the BTM effectively learns discriminative information between the target object and background. In contrast, the DTM is used to suppress distracting superpixels which significantly improves the tracker’s robustness and alleviates the drifting problem. A novel online random forest regression algorithm is proposed to build the two models. The BTM and DTM are linearly combined into a joint model to compute a confidence map. Tracking results are estimated using the confidence map, where the position and scale of the target are estimated orderly. Furthermore, we design a model updating strategy to adapt the appearance changes over time by discarding degraded trees of the BTM and DTM and initializing new trees as replacements. We test the proposed tracking method on two large tracking benchmarks, the CVPR2013 tracking benchmark and VOT2014 tracking challenge. Experimental results show that the tracker runs at real-time speed and achieves favorable tracking performance compared with the state-of-the-art methods. The results also sug- gest that the DTM improves tracking performance significantly and plays an important role in robust tracking.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值