【ECCV 2024】InternVideo2: Scaling Foundation Models for Multimodal Video Understanding
一、前言
Authors: Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Chenting Wang, Guo Chen, Baoqi Pei, Ziang Yan, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang
单位:OpenGVLab, Shanghai AI Laboratory
Abstract
介绍:
我们推出了 InternVideo2,这是一个新的视频基础模型 (ViFM)