斯坦福cs224课程笔记

最新推荐文章于 2024-07-03 14:44:29 发布

cookie_17

最新推荐文章于 2024-07-03 14:44:29 发布

阅读量1.4k

点赞数

分类专栏：深度学习 nlp

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/u012892939/article/details/79442108

版权

深度学习同时被 2 个专栏收录

10 篇文章 1 订阅

订阅专栏

5 篇文章 1 订阅

订阅专栏

CS224N: Natural Language Processing with Deep Learning

课程主页： CS224n: Natural Language Processing with Deep Learning

http://web.stanford.edu/class/cs224n/syllabus.html

Lecture 6: Dependency Parsing

Lecture 7: Tensorflow Tutorial

Lecture 8: RNN

vanishing gradient：initialization + Relu

exploding gradient：clipping trick

beam search

bidirectional RNN：双向rnn

Lecture 9: Fancy Recurrent Neural Networks for Machine Translation

MT：machine translation

机器翻译原始模型：

改进：

Encoder和Decoder使用不同的W，不再共享
解码过程，最后一个时刻的c，要传送给decoder的每个时刻的ht和yt;

同时，当前时刻的输出y t 也要传给下一时刻的h t+1 和y t +1

因此，encoder的h t 取决于c、h t-1、 y t-1

deep RNN with multiple layers
bidirectional encoder
把输入序列反转

Lecture 10: Midterm recap

SGD：

Lecture 11: machine translation and models with attention

相比于传统机器翻译，NMT的优点：

Attention

Global VS Local：

Beam Search

https://zhuanlan.zhihu.com/p/28048246

- 只用在test阶段，train和valid不用；

- 用在seq2seq生成模型中 decode阶段，要使生成序列的概率最大，取每个时刻上概率最大不可行；

- 贪心的思想；在每个时刻，对每个已有序列，对词典中所有的词计算概率，取概率最大的前k个序列；

- 结果不一定是全局最优；

greedy search： 贪婪地选取当前最可能的那个单词

解决Large-vocab问题

（下一讲最后）

vocab太大，每次预测概率时求softmax太耗时

解决办法：

1.segment data：

2.Select candidate words

Lecture 12: Further topics in Neural Machine Translation and Recurrent Models

LSTM

Lecture 13: End-to-end models for Speech Processing

Lecture 14：Convolutional Neural Networks (for NLP)

Lecture 15：Tree Recursive Neural Networks and Constituency Parsing

treeRNN

Lecture 16：Coreference Resolution

指代消解

Lecture 17：Dynamic Neural Networks for Question Answering

Lecture 18：Issues in NLP and Possible Architectures for NLP

Lecture 19：Tackling the Limits of Deep Learning for NLP

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
斯坦福cs224课程笔记

CS224N: Natural Language Processing with Deep Learning课程主页：CS224n: Natural Language Processing with Deep Learninghttp://web.stanford.edu/class/cs224n/syllabus.htmlLecture 6: Dependency ParsingLecture ...
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。