![](https://img-blog.csdnimg.cn/20201014180756930.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
论文阅读笔记
文章平均质量分 94
Isangelaa
这个作者很懒,什么都没留下…
展开
-
【论文阅读】Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
【论文阅读】Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQAmotivationmotivation原创 2022-03-25 16:48:50 · 2980 阅读 · 0 评论 -
【论文阅读】Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data
【论文阅读】Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Datamotivationtaskdifficultcontributionmethodexperimentdatasetsimage descriptionvideo descriptionmetricsResults可视化conclusionRelated WorkDeep CaptioningZero-Shot原创 2022-03-01 15:07:54 · 272 阅读 · 0 评论 -
【论文阅读】TextCaps: a Dataset for Image Captioning with Reading Comprehension
【论文阅读】TextCaps: a Dataset for Image Captioning with Reading Comprehensionmotivationtaskdifficultychallengescontributionrelated worksImage CaptioningOptical Character Recognition (OCR)Visual Question Answering with Text Reading AbilityTextCaps Dataset数据集构建数原创 2022-02-28 12:12:24 · 2776 阅读 · 0 评论