行为识别论文笔记|ARTNet|Appearance-and-Relation Networks for Video Classification

Alex丶Chen

于 2020-11-30 08:48:15 发布

阅读量378

点赞数

分类专栏：视频理解行为识别

本文链接：https://blog.csdn.net/njuptalex/article/details/110366317

版权

ARTNet论文笔记介绍了Wang等人提出的一种新型视频分类架构，它结合了外观和关系分支来增强时空表示。相比两流CNN和3D CNN，ARTNet通过SMART块在减少计算消耗的同时提高准确性，尤其是对于局部特征的建模。实验表明，它在Kinetics训练和UCF101、HMDB测试集上表现良好，但可能在时序建模效率上不如3D CNN，并且没有使用残差结构可能导致深层网络时序信息减弱。

摘要由CSDN通过智能技术生成

行为识别论文笔记-ARTNet-Appearance-and-Relation Networks for Video Classification

Wang, Limin, et al. “Appearance-and-relation networks for video classification.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.

Motivation

3 kinds of architectures for video classification: (1) two-stream CNNs (time-consuming, optical flow in advance) (2) 3D CNNs (worse than two stream) and (3) 2D CNNs with temporal models on top such as LSTM, temporal convolution, sparse sampling and aggregation, and attention modeling. (worse in local spatiotemporal representation)

multiplicative interactions to model relation between different views: Gated Boltzmann machines, Energy models, Independent Subspace Analysis (ISA)(similar to Energy mod

最低0.47元/天解锁文章

Alex丶Chen

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
行为识别论文笔记|ARTNet|Appearance-and-Relation Networks for Video Classification

视频理解论文笔记-ARTNet-Appearance-and-Relation Networks for Video ClassificationWang, Limin, et al. “Appearance-and-relation networks for video classification.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.Motivation3 k
复制链接

扫一扫

专栏目录