行为识别论文笔记-I3D T3D S3D R(2+1)D P3D CSN
I3D
Carreira, Joao, and Andrew Zisserman. “Quo vadis, action recognition? a new model and the kinetics dataset.” proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017.
T3D
Diba, Ali, et al. “Temporal 3d convnets: New architecture and transfer learning for video classification.” arXiv preprint arXiv:1711.08200 (2017).
S3D
Xie, Saining, et al. “Rethinking spatiotemporal feature learning: Speed-accuracy trade-offs in video classification.” Proceedings of the European Conference on Computer Vision (ECCV). 2018.
R(2+1)D
Tran, Du, et al. “A closer look at spatiotemporal convolutions for action recognition.” Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2018.
P3D
Qiu, Zhaofan, Ting Yao, and Tao Mei. “Learn