1 image caption 的阶段性飞跃。
Year | Paper | Abstract |
---|---|---|
2015 | Show, Attend and Tell: Neural Image Caption Generation with Visual Attention | Inspired by recent work in machine translation and object detection, we introduce an attention based model that automatically learns to describe the content of images. |
2017 | Deep Reinforcement Learning-based Image Captioning with Embedding Reward | Most state-of-the-art approaches follow an encoder-decoder framework, which generates captions using a sequential recurrent prediction model. However, in this paper, we introduce a novel decision-making framework for image captioning. We utilize a “policy network” and a “value network” to collaboratively generate captions. |