2019.9.5 note

最新推荐文章于 2023-05-16 11:50:47 发布

pku_zzy

最新推荐文章于 2023-05-16 11:50:47 发布

阅读量3.4k

点赞数

分类专栏： Paper Reading

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/pku_zzy/article/details/102595004

版权

本文汇集了多个关于BERT及其在NLP中应用的研究，探讨了BERT如何在预训练中捕获语法和语义信息，以及如何通过结构探针对词汇表示进行分析。还介绍了BERT在多任务学习、知识蒸馏、语义理解、句向量表示、适应器模块等方面的改进和应用。此外，文章提到了RoBERTa、SenseBERT、Sentence-BERT等BERT的变体，并指出XLNET通过自注意力机制实现了双向上下文的理解。

摘要由CSDN通过智能技术生成

2019.9.5 note

A Structural Probe for Finding Syntax in Word Representations

The probe identifies a linear transformation under which squared L2 distance encodes the distance between words in the parse tree, and one in which squared L2 norm encodes depth in the parse tree. Using this probe, we show that such transformations exist, providing evidence that entire syntax trees are embedded implicitly in deep models’ vector geometry.

This defines $d(x, y)=f(x)^Tf(y)$ and $f(x)=Av_x$ for BERT embedding $v$ . This finds that this $d$ can learn the distances on parsing tr

最低0.47元/天解锁文章

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
1
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

评论 1

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。