视觉导航(二)VISUAL TRANSFORMER NETWORK FOR OBJECT GOAL NAVIGATION VTNet:VISUAL TRANSFORMER NETWORK FOR OBJECT GOAL NAVIGATION
论文阅读笔记(二):Bridging Video-text Retrieval with Multiple Choice Questions Bridging Video-text Retrieval with Multiple Choice Questions
论文阅读笔记(一):Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval 视频文本检索(video-text-retrieval)、transformer处理视频