Transformer技术学习(原理+代码)
1. 论文
Attention Is All You Need https://arxiv.org/abs/1706.03762
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context https://arxiv.org/abs/1901.02860
2. Transformer原理
1.【NLP】Transformer详解 https://zhuanlan.zhihu.com/p/44121378
2. 详解Transformer (Attention Is All You Need) https://zhuanlan.zhihu.com/p/48508221
3. 模型详解 https://terrifyzhao.github.io/2019/01/11/Transformer模型详解.html
4. 深度学习:transformer模型