Attention is all you need (一)

最新推荐文章于 2024-07-10 22:17:05 发布

W&J

最新推荐文章于 2024-07-10 22:17:05 发布

阅读量203

点赞数

分类专栏： NLP论文文章标签：深度学习机器学习自然语言处理

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/hangzuxi8764/article/details/126764917

版权

NLP论文专栏收录该内容

2 篇文章 0 订阅

订阅专栏

论文地址：https://arxiv.org/abs/1706.03762

1、本篇论文提出的模型是 Transformer。

2、适用的任务是 sequence modeling(例语言模型) 和 sequence transduction(例机器翻译)。

3、目前主流的方法是用基于RNN的或CNN的encoder-decoder结构，在encoder和decoder中间用attention机制做连接。

4、Transformer 解决的问题是，减少计算时间复杂度，加快训练速度，提升模型效果。

5、Transformer 解决的方法是，用attention替代encoder和decoder中的RNN结构，Transformer中只有attention。

本篇读

浅看一下

3 Model Architecture

0 Abastract

目前机器翻译的主流模型用的是基于RNN或CNN的encoder-decoder模型，encoder和decoder中间用attention进行连接会取得更好的效果。

本篇论文提出的Transformer，仅仅有 attention mechanisms的网络结构，不依赖与RNN 和 CNN的网络结构。

实验效果：效果更好，更加并行化，训练时间少。

7 Conclusion

1、本篇论文首次提出一个完全基于attention的sequence transduction模型，称为Transformer。用multi-headed self-attention替代encoder-decoder结构中常见的RNN。

2、在翻译的任务上，Transformer的训练速度明显快于基于RNN和CNN的结构。

3、未来展望：计划将Transformer应用在其他任务上；将Transformer扩展到输入输出为图像、音频、视频的任务上（这盛世如你所愿！）

4、代码地址：https://github.com/tensorflow/tensor2tensor

(用pytorch的推荐：GitHub - jadore801120/attention-is-all-you-need-pytorch: A PyTorch implementation of the Transformer model in "Attention is All You Need".）

3 Model Architecture

1、模型结构图

2、attention、RNN、CNN 的时间复杂度对比

6 Results

三个实验数据：翻译、调参、英语成份句法分析

1、Machine Translation

在EN-DE，EN-FR的翻译任务上，对比下列模型的BLEU指标和训练成本。

2、Model Variations

为评估Transformer的不同组成部分的重要性，改变attention相关的几个参数：muti-heads中N的个数，self-attention中key 和 value的维度。

3、English constituency Parsing

为评估Transformer是否可以用于其他任务，用英语成份句法分析做了实验对比。

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Attention is all you need (一)

本篇是读Transformer模型的论文《attention is all you need》的第一个部分，读摘要、结论，浅看模型结构图和实验对比表
复制链接

扫一扫

专栏目录

博客等级

码龄8年

19
原创

128
点赞

534
收藏

25
粉丝

关注

私信

热门文章

分类专栏

最新评论

无法连接NVIDIA驱动：NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver
weixin_44700433: Module nvidia/520.61.05 already installed on kernel 5.15.0-105-generic/x86_64
无法连接NVIDIA驱动：NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver
夜半推窗雨: NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.这种情况可能是没有禁用系统中自带的nouveau
无法连接NVIDIA驱动：NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver
小怪胖头鱼: Kernel preparation unnecessary for this kernel. Skipping... Building module: cleaning build area... 'make' -j32 NV_EXCLUDE_BUILD_MODULES='' KERNEL_UNAME=6.5.0-15-generic modules.....(bad exit status: 2) ERROR (dkms apport): binary package for nvidia: 535.54.03 not found Error! Bad return status for module build on kernel: 6.5.0-15-generic (x86_64) Consult /var/lib/dkms/nvidia/535.54.03/build/make.log for more information. 请问大佬们有遇到这个问题的吗怎么解决的呀
无法连接NVIDIA驱动：NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver
努力的BigJiang: cd /usr/src找不到版本号杂办
无法连接NVIDIA驱动：NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver
m0_56970585: 爱死23.09

您愿意向朋友推荐“博客详情页”吗？

强烈不推荐
不推荐
一般般
推荐
强烈推荐

提交

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。