深度学习基础复习2：RNN系列

最新推荐文章于 2022-10-22 20:41:19 发布

渣渣宇

最新推荐文章于 2022-10-22 20:41:19 发布

阅读量848

点赞数

文章标签： RNN LSTM GRU

本文链接：https://blog.csdn.net/qq_28835913/article/details/85616481

版权

1 RNN的演变

语言建模是预测下一个单词的任务，即给出一个单词序列 $x^{\left ( 1 \right )},x^{\left ( 2 \right )},...,x^{\left ( t \right )}$ ，计算下一个单词 $x^{\left ( t+1 \right )}$ 的概率分布：

$p\left ( x^{\left ( t+1 \right )}=w_{j}|x^{\left ( t \right )},...,x^{\left ( 1 \right )} \right )$

1.1 n-gram语言模型

the students opened their ______

• unigrams:   “the”,   “students”,   “opened”,   ”their”
• bigrams:   “the   students”,   “students   opened”,   “opened   their”
• trigrams:   “the   students   opened”,   “students   opened   their”
• 4-grams:   “the   students   opened   their”

Idea：收集关于不同的n-gram的统计数据，并使用这些来预测下一个单词。

Example：

假设我们正在学习一个4-gram的语言模型。

~~as the proctor started the clock, the~~ students opened their _____

In  the  corpus:
• "students  opened  their" occurred  1000  times
• "students  opened  their  books" occurred  400 times
• P(books |  students  opened  their)  =  0.4
• "students  opened  their  exams" occurred  100 times
• P(exams |  students  opened  their)  =  0.1

Should we have discarded the “proctor” context?

存在的问题：

稀疏性

“students opened their $w_{j}$ ”出现的次数为0----->添加小的

最低0.47元/天解锁文章

渣渣宇

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
深度学习基础复习2：RNN系列

1 RNN的演变语言建模是预测下一个单词的任务，即给出一个单词序列，计算下一个单词的概率分布： 1.1 n-gram语言模型the students opened their ______• unigrams: “the”, “stu...
复制链接

扫一扫