【视频理解】最近几年视频分类技术综述
视频分类是一个难点,本文将介绍从论文的背景问题、核心思想、具体方案三个角度,阅读下面四篇文章。下面四篇文章主要考虑借助强化学习的方法,解决视频分类。
- Watching a small portion could be as good as watching all Towards efficient video classification(2018 IJCAI)
- AdaFrame: Adaptive Frame Selection for Fast Video Recognition(2019 CVPR)
- Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition(2019 ICCV)
- Dynamic Sampling Networks for Efficient ActionRecognition in Videos (2020 TIP)
1. Watching a small portion could be as good as watching all Towards efficient video classification(2018 IJCAI)
1.1 背景问题
视频是由一系列的帧构成,一般方法都是对全部视频帧使用CNN来提取特征,然后使用LSTM进行时间上的建模,最终输出视频的类别。但是提取全部视频帧的特征效率底下。基于此有人提出等间隔采样算法,但算法效率仍然较差。