句向量

最新推荐文章于 2022-04-26 09:30:00 发布

DecafTea

最新推荐文章于 2022-04-26 09:30:00 发布

阅读量579

点赞数

分类专栏： NLP

本文链接：https://blog.csdn.net/DecafTea/article/details/114238833

版权

NLP 专栏收录该内容

52 篇文章 3 订阅

订阅专栏

词向量得到句向量

1）bag of words求平均
2）TF-IDF加权平均
3）SIF加权平均
在这里插入图片描述
That is, the MLE is approximately a weighted average of the vectors of the words in the sentence.Note that for more frequent words w, the weight a/(p(w) + a) is smaller, so this naturally leads to a down weighting of the frequent words.

在这里插入图片描述
To estimate cs, we estimate the direction c0 by computing the first principal component of c˜s’s for a set of sentences. In other words, the final sentence embedding is obtained by subtracting the projection of c˜s’s to their first principal component.

在这里插入图片描述

直接得到句向量

1）Encoder：RNN/LSTM得到序列末尾的hidden vector；若双层，则concat得到的两个hidden vector
RNNs using long short-term memory (LSTM) capture long-distance dependency and have also been used for modeling sentences (Tai et al., 2015)。
2）BERT：[CLS]对应位置的输出即为句向量
3）skip-thought vectors：Skip-thought of (Kiros et al., 2015) tries to reconstruct the surrounding sentences from surrounded one and treats the hidden parameters as their vector representations.

DecafTea

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
句向量

词向量得到句向量1）bag of words求平均2）TF-IDF加权平均3）SIF加权平均That is, the MLE is approximately a weighted average of the vectors of the words in the sentence.Note that for more frequent words w, the weight a/(p(w) + a) is smaller, so this naturally leads to a down.
复制链接

扫一扫

专栏目录