![](https://img-blog.csdnimg.cn/20201014180756918.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
深度学习论文阅读
文章平均质量分 70
WXiujie123456
小菜鸟一个呜呜呜
展开
-
【76】论文阅读Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
在这项工作中,作者建议学习视频表征,基于网络instructional videos及其叙述的大规模数据集,在不使用人工注释的情况下,对动作步骤及其时间顺序进行编码。本模型在step分类(+2.8%/+3.3%在COIN / EPIC-Kitchens)和step预测(+7.4%在COIN)上显著提高了最新的结果。此外,本模型在step分类和预测的zero-shot推理以及对不完整过程的不同和合理步骤的预测方面取得了很好的结果。原创 2023-07-24 16:56:40 · 205 阅读 · 0 评论 -
阅读论文【78】 CVPR 2023 Procedure-Aware Pretraining for Instructional Video Understanding
作者认为,instructional video描述了在相同或不同任务的实例之间重复的步骤序列,并且这种结构可以用程序知识图(Procedural Knowledge Graph,PKG)很好地表示,其中节点是离散的步骤,边连接instructional活动中顺序发生的步骤。然后可以使用该图生成伪标签来训练视频表征,该表征以更易于访问的形式对程序性知识编码,以推广到多个过程理解任务。原创 2023-07-12 15:16:56 · 178 阅读 · 0 评论 -
论文阅读 Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First
Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person VideoECCV 2020task:anticipating human-object interaction in first person videos原创 2022-04-03 12:37:54 · 245 阅读 · 0 评论 -
论文阅读 Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
Colar: Effective and Efficient Online Action Detection by Consulting ExemplarsCVPR 2022task:在线动作识别原创 2022-04-03 12:27:33 · 1091 阅读 · 0 评论 -
论文阅读 End-to-End Semi-Supervised Learning for Video Action Detection
End-to-End Semi-Supervised Learning for Video Action Detection的阅读笔记 CVPR 2022task:端到端的半监督视频动作检测方法原创 2022-04-03 12:15:11 · 1284 阅读 · 0 评论 -
论文阅读 X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition论文分享Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition CVPR 2020 task:将二维方法拓展到三维的视频识别方法原创 2022-04-03 11:42:43 · 402 阅读 · 0 评论 -
论文阅读 Video Transformer Network
Video Transformer NetworkICCV 2021task:基于变压器的视频识别框架VTN原创 2022-04-02 21:55:37 · 764 阅读 · 0 评论 -
论文阅读 Skeleton-based abnormal gait recognition with spatio-temporal attention enhanced
Skeleton-based abnormal gait recognition with spatio-temporal attention enhanced gait-structural graph convolutional networksNeurocomputing 2022task:基于骨架特征的步态识别原创 2022-04-02 21:46:16 · 1065 阅读 · 2 评论 -
论文阅读 Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows
关于长时间外科工作流识别的一篇论文分享原创 2022-04-01 10:32:04 · 2973 阅读 · 0 评论 -
论文阅读 Intention Recognition of Pedestrians and Cyclists by 2D Pose Estimation
关于行人和骑自行车者过马路的意图理解的一篇论文分享原创 2022-04-01 10:18:44 · 1547 阅读 · 0 评论 -
论文阅读 Rethinking Anticipation Tasks: Uncertainty-aware Anticipation of Sparse Surgical Instrument
手术工具识别方向的一篇论文分享原创 2022-04-01 10:04:25 · 165 阅读 · 0 评论 -
论文阅读 Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks
论文阅读记录原创 2022-04-01 09:41:40 · 362 阅读 · 0 评论 -
论文阅读 Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks
Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection的阅读原创 2022-03-31 11:17:36 · 403 阅读 · 0 评论 -
论文阅读 Learning Transferable Visual Models From Natural Language Supervisio
论文阅读整理原创 2022-04-01 09:15:56 · 437 阅读 · 2 评论 -
论文阅读 TSM: Temporal Shift Module for Efficient Video Understanding
个人阅读笔记原创 2022-03-31 16:14:06 · 426 阅读 · 0 评论 -
论文阅读 ActionCLIP: A New Paradigm for Video Action Recognition
论文阅读个人整理原创 2022-03-31 15:55:42 · 3668 阅读 · 1 评论 -
论文阅读 RSDNet: Learning to Predict Remaining Surgery Duration from Laparoscopic Videos Without Manual
论文阅读的一个 个人版整理原创 2022-03-31 15:40:37 · 169 阅读 · 0 评论 -
论文阅读 SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network
SV-RCNet: Workflow Recognition From Surgical Videos Using Recurrent Convolutional Network的论文阅读分享原创 2022-03-31 11:32:57 · 1033 阅读 · 1 评论 -
论文阅读 MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation的阅读记录原创 2022-03-31 10:39:41 · 494 阅读 · 0 评论 -
论文阅读 Hybrid Recurrent Neural Network Architecture-Based Intention Recognition for Human-Robot
Hybrid Recurrent Neural Network Architecture-Based Intention Recognition for Human-Robot Collaboration的阅读记录分享原创 2022-03-31 10:29:49 · 230 阅读 · 0 评论 -
论文阅读 Towards Unified Surgical Skill Assessment
阅读论文分享原创 2022-03-31 10:15:15 · 240 阅读 · 0 评论 -
深度学习相关阅读论文汇总(持续更新)
本文是对我深度学习过程中一些论文的记录,大部分是基于个人对论文的理解,仅供参考原创 2022-03-31 09:58:49 · 4542 阅读 · 1 评论 -
文献阅读:Long-Term Temporal Convolutions(LTC)for Action Recognition
本作者使用具有长期时间卷积 (LTC) 的神经网络来学习视频表示。证明了增加时间范围的LTC-CNN模型提高了动作识别的准确性。还研究了不同低级表示的影响,例如视频像素的原始值和光流矢量场,并证明了高质量光流估计对于学习准确动作模型的重要性。原创 2022-03-31 09:45:07 · 1154 阅读 · 0 评论