CS224N_2019_Assignment3: Dependency Parsing (Solution)

最新推荐文章于 2022-06-06 11:01:23 发布

席德

最新推荐文章于 2022-06-06 11:01:23 发布

阅读量1.2k

点赞数 1

分类专栏：機器學習文章标签： nlp 深度学习

本文链接：https://blog.csdn.net/qq_44695832/article/details/103642336

版权

前言

A3作業讓你學會建立neural dependency parser的同時也能熟悉Pytorch的用法。
Written part是關於Adam和Dropout的解答與思考，這部分教授在課上解釋的比較少，但屬於neural network的重點之一，建議閱讀相關文獻加深這部分的理解。
Coding part是關於運用wrriten part的optimizer trick建立一個完整的simple neural net，並進行模型訓練。

題目詳情

– Written Part –

#1. Machine Learning & Neural Networks (8 points)

Answer：

( a )
i. Using m updates the gradient by multiplying it by α(1-β) times, reducing the gradient even further than SGD.

ii. v will get larger updates since its calculation contains the power of the gradients. If v is larger than 1, the updated v will be larger; if v is smaller than 1, the updated v will become smaller. This can help with learning by avoiding the learning rate being too large(exploding) or too small(vanishing) through the calculation of the division (√v).

( b )
i. $\frac{1}{1-p_{drop}}$ .
Since
$h_{drop} = γd⊙h$
∵ $h_{drop}=γ(1-p_{drop})⊙h=h$
∴ $γ(1-p_{drop})=1$

最低0.47元/天解锁文章

席德

关注

1
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
CS224N_2019_Assignment3: Dependency Parsing (Solution)

@[TOC](CS 224n (2019) Assignment #3 written+coding 作業答案 )前言A3作業讓你學會建立neural dependency parser的同時也能熟悉Pytorch的用法。Written part是關於Adam和Dropout的解答與思考，這部分教授在課上解釋的比較少，但屬於neural network的重點之一，建議閱讀相關文獻加深這部分的...
复制链接

扫一扫