论文记录：BI-DIRECTIONAL ATTENTION FLOW FOR MACHINE COMPREHENSION

最新推荐文章于 2023-03-19 18:15:01 发布

zy04

最新推荐文章于 2023-03-19 18:15:01 发布

阅读量187

点赞数

分类专栏： paper 文章标签： BiDAF textual comprehension QA

本文链接：https://blog.csdn.net/qq_14810321/article/details/100163535

版权

paper 专栏收录该内容

17 篇文章 0 订阅

订阅专栏

BiDAF

在这里插入图片描述
Char-CNN： Convolutional Neural Networks for Word Embedding in Sentence. Which cover local words embeddings to generate next-stage embedding. Multiple filters can be used to represent different mapping relations in different sub-spaces for different structural information.
GLOVE: pre-trained word embedding. Which emphasizes linear relations of word embedding vectors.
Context2Query: Attention model to embed context words with weighted sum of query word embeddings.
Query2Context: For each context word i, select j in [1, J] with maximum affinity score(attention weight) s(i, j), corresponding query word embedding is uj. Let maxU = [uj1, uj2, …, ujT], S = [s(1, j1), s(2, j2), s(T, jT)], then g is weighted sum as
sum_over_i { s(i, ji) * maxU(i) }. g is scattered to all LSTM cells in modeling Layer as input.