Video-Text Retrieval论文汇总
Video-Text Retrieval:2020 CVPR ViT An Image Is Worth 16X16 Words Transformers for image recognition at scale 2021CVPR Dual Encoder Frozen in Time A Joint Video and Image Encoder for End-to-End Retrieval 2021ELSEVIER MAN Multi...










