Paper reading: Realtime Multi-person 2D Pose estimation using Part Affinity Fields(1)

最新推荐文章于 2023-12-27 21:37:39 发布

yengjie

最新推荐文章于 2023-12-27 21:37:39 发布

阅读量7.6k

点赞数 4

分类专栏： DL pose estimation 文章标签： DL pose estimation

本文链接：https://blog.csdn.net/yengjie2200/article/details/68064095

版权

本文介绍了实时多人2D姿态估计方法，使用底部向上(bottom-up)的方式，通过Part Affinity Fields(PAFs)利用全局上下文信息。PAFs是2D向量场，编码肢体的位置和方向，与confidence maps结合，通过顺序预测框架进行联合学习和预测。在检测部分和关联方面，PAFs允许高效算法通过贪婪关联最小生成树实现，而不会显著降低姿态估计质量。

摘要由CSDN通过智能技术生成

本论文有提供代码https://github.com/CMU-Perceptual-Computing-Lab/caffe_rtpose ，可运行。

以下为本人对文章的理解，如果有错误欢迎讨论，如转载请标明出处。

1。 Introduction

Pose estimation 的挑战：

1〉图像中不知道多少人，在什么位置，什么尺度

2〉人与人之间因接触，遮挡而变得复杂

3〉实时性的要求，图像中人越多，计算复杂度越大。

A common approach: person detection + pose estimation for each person (top->down)

问题: 1〉if person detector fails-> no recovery (人离得近的时候person detector很容易检测不到）

2〉计算时间和人数有关，人越多越耗时。

bottom up approaches 不存在以上两个问题。

但bottom up不直接受益于global information -〉关键是利用来自other body parts and other people的contextual cues（上下文线索）。

本文使用bottom up 的方法，but utilizes global contextual information in the detection of parts and their association。

本文提出Part Affinity Fields (PAFs), a set of 2D vector fields。每个2D vector field 会encode 一个limb（肢）的位置和方向。

这些fields（包含parts的连接和方向）和 confidence maps for parts （关节的置信map）一起通过sequential prediction framework来jointly学习和预测。

confidence maps for parts和Part Affinity Fields 都是2D spatial grids, 可以表达unstructured, multimodal uncertainty hat arises due to occlusion and contact，而且可以用卷积分析。

-------

下面这句话，不太理解：

As the confidence maps and affinity fields encode global context in their prediction, they allow an efficient algorithm that uses greedy association over a minimum spanning tree without significant loss in the quality of pose estimates.

------