Attention Is All You Need 阅读翻译
Abstract 摘要The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also connect the encoder and decoder through an attention mechanism..
原创
2020-06-14 17:26:02 ·
1508 阅读 ·
2 评论