解读flow-guided feature aggregation for video object detection

文章主要贡献点:

Flow-guided feature aggregation, an end-to-end framework for video object detection.

Improve the per-frame features by aggregation of nearby features along the motion path, and thus improve the video recognition accuracy.

  Or improve the per-frame feature learning by temporal aggregation

数据库ImageNet VID dataset

              3862 video snippet from the traning set

              555 snippets from the validation set

              Fully annotated

              30 object categories (a subset of the categories in the ImageNet DET dataset),

相关工作:


本文工作:

      1. the feature extraction network is applied on individual frames to produce the per-frame feature maps

       2. To enhance the features at a reference frame, an optical flow network  [flownet] estimates themotions between the nearby frames an the reference frame

       3. The feature maps from nearby frames are warped to the reference maps, as well as its own feature maps on the reference frame, areaggregated according to an        adaptive weighting network. 

        4. The resulting aggregated feature maps are then fed to the detection network to produce the detection result on the reference frame.

         System: Feature extraction + flow estimation + feature aggregation + detection







评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值