CS224N notes_chapter11_Review GRU & LSTM

第十一讲 Review GRU & LSTM

原视频中还涉及一些关于MT的其他议题,笔者在此处忽略了。

GRU

idea: Perhaps we could use shortcut connections to prevent model from gradient vanishing. -> adaptive shortcut connections( u t u_t ut).
f ( h t − 1 , x t ) = u t ⊙ h ^ t + ( 1 − u t ) ⊙ h t − 1 h ^ t = t a n h ( W [ x t ] + U h t − 1 + b ) u t = σ ( W u [ x t ] + U u h t − 1 + b u ) \begin{aligned} f(h_{t-1},x_t) =& u_t\odot \hat h_t + (1-u_t)\odot h_{t-1} \\ \hat h_t =& tanh(W[x_t]+Uh_{t-1}+b) \\ u_t =& \sigma(W_u[x_t]+U_uh_{t-1}+b_u) \end{aligned} f(ht1,xt)=h^t=ut=uth^t+(1ut)ht1tanh(W[xt]+Uht1+b)σ(Wu[xt]+Uuht1+bu)
idea: Prune unnecessary connections adaptively( r t r_t rt).
h ^ t = t a n h ( W [ x t ] + U ( r t ⊙ h t − 1 ) + b ) r t = σ ( W r [ x t ] + U r h t − 1 + b r ) u t = σ ( W u [ x t ] + U u h t − 1 + b u ) \begin{aligned} \hat h_t =& tanh(W[x_t]+U(r_t\odot h_{t-1})+b) \\ r_t =& \sigma(W_r[x_t]+U_r h_{t-1}+b_r) \\ u_t =& \sigma(W_u[x_t]+U_uh_{t-1}+b_u) \end{aligned} h^t=rt=ut=tanh(W[xt]+U(rtht1)+b)σ(Wr[xt]+Urht1+br)σ(Wu[xt]+Uuht1+bu)

Some tricks to train RNN

  1. Use LSTM or GRU
  2. initialize recurrent matrices to be orthogonal
  3. initialize other matrices with a sensible scale
  4. initialize forget gate bias to 1: default to remembering
  5. Adam, Adadelta
  6. clip norm.
  7. dropout vertically

Ensemble.

MT evaluation

  • Manual
  • Testing in an application that uses MT as one sub-componet
  • Automatic metric
    • WER word error rate
    • BLEU Bilingual Evaluation Understudy
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
分析 x Symbol: SND_SOC_ROCKCHIP_I2S [=n] x x Type : tristate x x Prompt: Rockchip I2S Device Driver x x Location: x x -> Device Drivers x x -> Sound card support (SOUND [=y]) x x -> Advanced Linux Sound Architecture (SND [=y]) x x (6) -> ALSA for SoC audio support (SND_SOC [=y]) x x -> ASoC support for Rockchip (SND_SOC_ROCKCHIP [=n]) x x Defined at sound/soc/rockchip/Kconfig:18 x x Depends on: SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && CLKDEV_LOOKUP [=y] && SND_SOC_ROCKCHIP [=n] x x Selects: SND_SOC_GENERIC_DMAENGINE_PCM [=n] x x Selected by [n]: x x - SND_SOC_ROCKCHIP_MAX98090 [=n] && SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && SND_SOC_ROCKCHIP [=n] && I2C [=y] && GPIOLIB [=y] && CLKDEV_LOOKUP [=y] x x - SND_SOC_ROCKCHIP_RT5645 [=n] && SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && SND_SOC_ROCKCHIP [=n] && I2C [=y] && GPIOLIB [=y] && CLKDEV_LOOKUP [=y] x x - SND_SOC_ROCKCHIP_RT5651 [=n] && SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && SND_SOC_ROCKCHIP [=n] && I2C [=y] && GPIOLIB [=y] && CLKDEV_LOOKUP [=y] x x - SND_SOC_ROCKCHIP_RT5651_RK628 [=n] && SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && SND_SOC_ROCKCHIP [=n] && I2C [=y] && GPIOLIB [=y] && CLKDEV_LOOKUP [=y] x x - SND_SOC_RK3288_HDMI_ANALOG [=n] && SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && SND_SOC_ROCKCHIP [=n] && I2C [=y] && GPIOLIB [=y] && CLKDEV_LOOKUP [=y] x x - SND_SOC_RK3399_GRU_SOUND [=n] && SOUND [=y] && !UML && SND [=y] && SND_SOC [=y] && SND_SOC_ROCKCHIP [=n] && I2C [=y] && GPIOLIB [=y] && CLKDEV_LOOKUP [=y] && SPI [=y] x
07-20

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值