参考论文:
Rico Sennrich, Barry Haddow, and Alexandra Birch.2016. Edinburgh neural machine translation systems for wmt 16. arXiv preprint arXiv:1606.02891.
Rico Sennrich, Barry Haddow, and Alexandra Birch. 2016a. Improving Neural Machine Translation Models with Monolingual Data. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), Berlin, Germany.
还是这个Sennrich,他在WMT16中说:We found that during decoding, the model would occasionally assign a high probability to
words based on the target context alone, ignoring the source sentence.
具体做法是 we experiment with training separate models that produce the target text from right-to-left (r2l), and re-scoring the nbest lists that are produced by the main (left-toright) models with these r2l models. Since the right-to-left model will see a complementary target context at each time step, we expect that the averaged probabilities will be