VQA系列论文(二) 论文阅读:《Multimodal Graph Networks for Compositional Generalization in Visual Question Answering》
VQA系列论文(一) 阅读论文:《MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering》