[Paper note] Video-based Person Re-identification with Accumulative Motion Context

Highlight

  • Two stream: spatial + temporal (optical flow).
  • Use a motion network pre-trained on optical flow to predict OF and also learn end-to-end in training phase.
  • Fusion of motion and spatial features
  • Multiloss: siamese reid and classification loss.

Model

  • Structure of the whole model:
    model
  • Structure of motion network (pre-trained on LK or Epic optical flow):
    motion network
  • Structure of spatial network:
    spatial network
  • Different spatial fusion method: concatenate, sum, max
  • Different spatial fusion position: @ any layer in spatial network
  • Motion context accumulation: via RNN (not LSTM in this paper)
  • Multiloss: siamese (distance) loss + classification (softmax)
  • Pre-train motion network on optical flow: smoothed L-1 loss (l=1,2,3 representing optical flow estimation with different resolutions)
    • L(l)(motion)(
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值