[paper]Transformer 相关论文阅读
[paper]Transformer-XL: Attentive Language Models(venv2.7) mi@mi-OptiPlex-7060:~/shenhao/study/transformer-xl/tf$ bash scripts/enwik8_base_gpu.sh train_dataProducing dataset...building vocab with ...
原创
2019-08-20 20:41:48 ·
565 阅读 ·
0 评论