READING NOTE: Chained Predictions Using Convolutional Neural Networks

最新推荐文章于 2019-03-04 22:11:16 发布

Joshua_Li_

最新推荐文章于 2019-03-04 22:11:16 发布

阅读量1.1k

点赞数

分类专栏：计算机视觉

本文链接：https://blog.csdn.net/joshua_1988/article/details/51493128

版权

计算机视觉专栏收录该内容

72 篇文章 0 订阅

订阅专栏

TITLE: Chained Predictions Using Convolutional Neural Networks

AUTHER: Georgia Gkioxari, Alexander Toshev, Navdeep Jaitly

ASSOCIATION: UC Berkeley, Google

FROM: arXiv:1605.02346

CONTRIBUTIONS

A chain model for structured outputs, such as human pose estimation. The output convolutional neural networks is a multiscale deconvolution that we called deception because of its relationship to deconvolution and inception models.
Two formulations of the chain model is proposed. One is without weight sharing between different predictors (poses in images) and the other is with weight sharing (poses in videos).

METHOD

There are two formulations of the chain model in this work. The one used for single image is taken as an example here. It is a similar procedure in video version.

The inference stage is illustrated in the figure. The input is the image and the image is first fed to a CNN denoted as CNNx. For every stage, a joint of the person is localized by a CNN denoted as CNNy, denoted as “Predictio@0”. Then both the input and output of CNNy is used to predict next joint in the next stage. The procedure can be formalized as:

h t = σ (w h t * h t - 1 + \sum i = 0 t - 1 w y i, t * e (y i))

$h_t=\sigma(w_t^h \ast h_{t-1}+\sum_{i=0}^{t-1}w_{i,t}^y \ast e(y_i))$

P (Y t = y t | X, y 0, . . ., y t - 1) = S o f t m a x (m t (h t))

$P(Y_t=y_t|X,y_0,...,y_{t-1})=Softmax(m_t(h_t))$

where $h_0$ =CNNx(x), $e(\cdot)$ is a full neural net, $m_t$ is the operation of CNNy on $h_t$ , and $P$ is the probability of the location of a joint.

ADVANTAGES

Using chain models allows us to sidestep any assumptions about the joint distribution of the output variables.
Jointly considering other structures can lead to better performance.
Hand-crafted features are replaced by CNN, which can be learnt end-to-end.

DISADVANTAGES

$e(\cdot)$ is not explained in this work.

Joshua_Li_

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
READING NOTE: Chained Predictions Using Convolutional Neural Networks

TITLE: Chained Predictions Using Convolutional Neural Networks
复制链接

扫一扫

专栏目录