Relation Network for Person Re-identiﬁcation阅读总结

最新推荐文章于 2023-03-19 23:27:57 发布

MindAndHand

最新推荐文章于 2023-03-19 23:27:57 发布

阅读量806

点赞数 1

文章标签： reid 阅读笔记分块块间关系

本文链接：https://blog.csdn.net/qq_35226955/article/details/103440021

版权

Relation Network for Person Re-identiﬁcation阅读笔记

What？

直接PCB太暴力了，没有考虑到块与块之间的关系。于是本文提出了一种one-vs-rest relational 策略考虑了块与块之间的关系。具体如下：
在这里插入图片描述
以 $p_1$ 为例，上图 $p_1$ ~ $p_6$ 的获取方式和PCB完全一致，后面则略有不同。

这里将 $p_2$ ~ $p_6$ 的结果直接做 $r_1=(p_2+p_3+p_4+p_5+p_6)/5$ ，然后 $\times 1$ 卷积变换通道得到 $\bar r_1$ ，同时 $p_1$ 通过 $\times 1$ 卷积变换通道得到 $\bar p_1$ ，两者按channel做concat得到结果经过 $\times 1$ 卷积变换通道，所得结果与 $\bar p_1$ 做残差加法，得到最终结果 $q_1$ 。然后就可以说 $q_1$ 中包含了与 $p_2$ _{$p_6$有关的信息了，就考率了块与块之间的联系。其余同理，就可以得到$q_1$} $q_6$ 。

公式表达如下：
在这里插入图片描述
其中T表示concat。

然后就是作者提了一个GCP，和以往有啥差别呢？下图直接对比：
在这里插入图片描述

详细描述：
GAP , GMP , GAP+GMP都用过，各有好处，也各有缺陷。

GAP covers the whole body parts of the person image , but it is easily distracted by background clutter and occlusion.
GMP overcomes this problem by aggregating the feature from the most discriminative part useful for reID while discarding background clutter. This, however, does not contain information from the whole body parts.(背景区域基本不利于分类，因此激活值一般很小，通过GMP就自然被drop掉了)`
GAP +GMP may perform better, but it is also inﬂuenced by background clutter. It has been proven that GMP is more effective than GAP(Fu et al. 2019 SSG), which will be also veriﬁed once more in our experiment.
Motivated by this, we propose a novel GCP method based on GMP to extract a global feature map from the whole body parts . 具体咋做，如下图：

做法应该很清楚，这里不再赘述。和GMP，GAP差别也很明显，GCP引入了要学习的参数。

讲到这里，其实很懵，GCP是什么？要GCP干啥的？

GCP指的是Global Contrastive Pool。由于我们之前考虑了块之间的关系，而Contrastive 体现在哪？就是表现在 $p_{avg} - p_{max}$ 。 avg中保留了整个图像的信息，max是行人部分的信息，那差是什么？就是背景部分的信息。而结果和max的行人信息再合并。那去掉又合并岂不是白做了？不是的，concat(合并)之前还有一个conv的存在，因此其实还是不一样的，并不是减去又加上的操作，而是关注了一些更关键的信息。最后同样用一个残差保证学习的结果不会比之前差。