video recognition
Eudemonia_mia
这个作者很懒,什么都没留下…
展开
-
Two-Stream Convolutional Networks for Action Recognition in Videos [Paper Part]
1.Contributionpropose a two-stream ConvNet architecturespatial & tmporalConvNet trained on multi-frame dense optical flow is able to achieve very good performancemulti-task learncan incr...原创 2018-09-22 14:28:25 · 428 阅读 · 0 评论 -
Temporal Segment Networks:Towards Good Practices for Deep Action Recognition[Paper Part]
1.Aimdiscover the principle to design effective ConvNet architecture for action recognition in videoslearn these models given limited training samples2.ContributionTSNbased on the idea of lon...原创 2018-10-07 10:25:16 · 502 阅读 · 0 评论 -
Two-Stream Convolutional Networks for Action Recognition in Videos[summary part]
算法介绍双流网络使用以单帧RGB作为输入的CNN来处理空间维度的信息,使用以多帧密度光流场作为输入的CNN来处理时间维度的信息,并通过多任务训练的方法将两个行为分类的数据集联合起来(UCF101与HMDB),去除过拟合进而获得更好效果。贡献提出two-stream ConvNet来对时空特征进行建模表示提出了多帧光流作为输入,对性能提升作用很大源码未公开源码光流图像中物体的运...原创 2018-11-11 21:03:11 · 720 阅读 · 0 评论 -
Temporal Segment Networks[Summary part]
算法介绍当时研究的不足(尤其是双流):只能处理短期运动(short-term),对长期运动(long-range)时间结构进行理解不足训练样本较小提出的处理办法:弥补第一个不足:使用稀疏时间采样策略和基于视频监督的策略,将视频进行时域分割后随机抽取片段弥补第二个不足:交叉预训练、正则化技术和数据扩张技术源码公开源代码,基于caffe实现,以及另一种实现方式,基于pytorc...原创 2018-11-12 11:02:39 · 387 阅读 · 0 评论 -
Deep Learning of Action Recognition
总体思路:抽取并分类时空特征为目的的视频识别方法two-stream(双流)方法C3D方法CNN-LSTM方法以提取骨架信息进行再训练为目的的姿态估计方法原创 2018-11-12 13:16:58 · 219 阅读 · 0 评论 -
Deep Temporal Linear EncodingNetwork[Paper & Summary Part]
(1)Present a new video representation, called temporal linear encoding (TLE)(2)Embedded inside of CNNs as a new layer,which captures the appearance and motion throughout entire videos.Encodes this...原创 2018-12-08 20:07:27 · 680 阅读 · 0 评论