ReID简记-2-Spatial and Temporal Mutual Promotion for Video-based Person Re-identification

创新点

(1)Refining Recurrent Unit (RRU) 关注遮挡,外观噪声和运动信息
(2)Spatial-temporal Clues Integration Module (STIM) 整合时空信息
(3)Multi-level Training Objective 增强上述两者能力

总结

网络有点复杂,真的是玩出了花。
(1)RRU,可以看成时CNN的RNN化,堆叠了几个权值共享的CNN。在特征进入RRU之前,先经过了CNN(文中指Inception),然后这个inception 特征和上一帧的inception特征做差表示运动空间的反应,同事,当前帧的inception特征和上一帧的RRU特征做差表示物体的外观差别,将这两个特征差concate送入当前帧的RRU中的更新门g得到当前帧的RRU特征。update gate g 的定义是:
首先transition layer,conv+BN+ReLU;然后是两个分支 spatial attention model和channel attention model。两个分支比较普通,特征相乘然后输出。
通过g之后,再与之前的RRU特征和CNN特征做修正和融合(文中公式5)得到最后当前帧的RRU特征。
(2)STIM 这个模块比较简单,主要是两个3D卷积块和一个全局平均池化。
(3)Multi-level Training Objective,三个损失的和。分别是cross entropy loss, batch hard triplet loss和基于第二个loss的 part-level ranking constraint。接下来说一下第三个特征。将RRU特征水平分成H份,平均然后套用公式10。
想法比较好,就是有些复杂。

  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Deep person re-identification is the task of recognizing a person across different camera views in a surveillance system. It is a challenging problem due to variations in lighting, pose, and occlusion. To address this problem, researchers have proposed various deep learning models that can learn discriminative features for person re-identification. However, achieving state-of-the-art performance often requires carefully designed training strategies and model architectures. One approach to improving the performance of deep person re-identification is to use a "bag of tricks" consisting of various techniques that have been shown to be effective in other computer vision tasks. These techniques include data augmentation, label smoothing, mixup, warm-up learning rates, and more. By combining these techniques, researchers have been able to achieve significant improvements in re-identification accuracy. In addition to using a bag of tricks, it is also important to establish a strong baseline for deep person re-identification. A strong baseline provides a foundation for future research and enables fair comparisons between different methods. A typical baseline for re-identification consists of a deep convolutional neural network (CNN) trained on a large-scale dataset such as Market-1501 or DukeMTMC-reID. The baseline should also include appropriate data preprocessing, such as resizing and normalization, and evaluation metrics, such as mean average precision (mAP) and cumulative matching characteristic (CMC) curves. Overall, combining a bag of tricks with a strong baseline can lead to significant improvements in deep person re-identification performance. This can have important practical applications in surveillance systems, where accurate person recognition is essential for ensuring public safety.

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值