《A Through Examination of the CNN_Daily Mail Reading Comprehension Task》——Stanford Attentive Reader

最新推荐文章于 2022-08-27 18:02:22 发布

MoonLer

最新推荐文章于 2022-08-27 18:02:22 发布

阅读量289

点赞数

分类专栏： NLP deeplearning

本文链接：https://blog.csdn.net/qq_40240102/article/details/102805344

版权

53 篇文章 6 订阅

订阅专栏

32 篇文章 1 订阅

订阅专栏

序

在这里插入图片描述

模型分三部分：
第一部分，编码：问题的词编码一样，先通过一个embedding表，把词编程embedding，然后过双向GRU，前向和后向连在一起表示这个token出的表示，同样对问题也编码，只说了问题编码后的维度：h,估计和其他论文一样，都是前向后向的最后一个concat到一起。

在这里插入图片描述

在这里插入图片描述

和attentive reader对比，这里直接用o去预测了，没有像attentive reader一样再加上question 的embedding q，并且表现也不差。

这个模型最后预测时不用整个词库，只用了entity的词库。
最搞笑的是：加粗那一句，他们说只有第一个是最重要的，其他都是为了简化模型，所以模型核心就是换了一个attention 匹配函数，和张俊林大佬说的一样。
The original model considers all the words from the vocabulary V in making predictions. We think this is unnecessary, and only predict among entities which appear in the passage. Of these changes, only the first seems important; the other two just aim at keeping the model simple.

关注