Paper Reading: Neural Machine Translation by Jointly Learning to Align and Translate

最新推荐文章于 2024-03-20 13:53:30 发布

weixin_30651273

最新推荐文章于 2024-03-20 13:53:30 发布

阅读量96

点赞数

文章标签：人工智能

原文链接：http://www.cnblogs.com/naniJser/p/8900720.html

版权

这篇文章是论文"NEURAL MACHINE TRANSLATION BY JOINTLY LEARNING TO ALIGN AND TRANSLATE"的阅读笔记，这是2015年发表在ICLR的一篇文章。

ABSTRACT

NMT(neural machine translation)是个很多人研究过的问题，最近也突破很多。
回到这篇论文，当时解决NMT问题的做法主要是基于encoder-decoder框架的,这框架也挺好的，在很多领域表现都不错。但是，encoder部分把输入信息压缩到一个固定长度的vector中，这造成了性能的瓶颈。这篇论文提出的模型就是在翻译的过程中自动在输入中寻找与输出目标有关系的部分帮助决策。这就是这篇论文提出的方法的核心思想。

看一下原文是怎么说的?

In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder–decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.