关于深度学习的backpropagation

loss function:

c = 0.5*(y^n-y^t)^2

在gradient descent时:\tfrac{\partial C}{\partial w_i_j^l } = \tfrac{\partial C}{\partial z_j^l }*\tfrac{\partial z_j^l }{\partial w_i_j^l }    (前向*后向)

前向:\tiny \\z^l & = w^l * x^l + b^l \\ a^l & = \sigma (z^l) \\ x^l = a^{l-1} 所以 \tfrac{\partial z_j^l}{\partial w_i_j^l } = x^l_j=\alpha^{l-1}_i

后向:

\tfrac{\partial C}{\partial z^l} = [\tfrac{\partial C}{\partial a_1^l }* \sigma ^{'}{(z^l_1)},...,\tfrac{\partial C}{\partial a_n^l }* \sigma ^{'}{(z^l_n)}]

\tfrac{\partial C}{\partial z^{l-1}} = [\tfrac{\partial C}{\partial w^{l}}] *[\tfrac{\partial C}{\partial z^{l}}] *\sigma^{'}(z^{l-1})

x^l * \sigma(z^{l-1}) -> *w ^l>x^{l-1}

注:截图来自台湾大学李宏毅教授课程PPT,对于我本人理解BP很有启发

后向求导具体过程:

\\\tfrac{\partial C}{\partial z_j^l } = \tfrac{\partial C}{\partial a_j^l }*\tfrac{\partial a_j^l }{\partial z_j^l } \\ ...... ={\color{Red} \tfrac{\partial C}{\partial a_j^l }* \sigma ^{'}{(z^l_j)} }\\ \\ \tfrac{\partial C}{\partial z^l} = [\tfrac{\partial C}{\partial a_1^l }* \sigma ^{'}{(z^l_1)},...,\tfrac{\partial C}{\partial a_n^l }* \sigma ^{'}{(z^l_n)}]

\\\tfrac{\partial C}{\partial z_j^{l-1} } = \tfrac{\partial C}{\partial a_j^{l-1} }*\tfrac{\partial a_j^{l-1} }{\partial z_j^{l-1} } \\ .......= \sum _{i=1,n^l}\tfrac{\partial C}{\partial a_i^{l} }*\tfrac{\partial a_i^{l} }{\partial a_j^{l-1} }*\tfrac{\partial a_j^{l-1} }{\partial z_j^{l-1} } \\ .... ...= \sum _{i=1,n^l}\tfrac{\partial C}{\partial a_i^{l} }*\tfrac{\partial \sigma(w^l_{ji} * {a^{l-1}_{j}}+b^l_{ji})) }{\partial a_j^{l-1} }* \sigma ^{'}{(z_{j}^{l-1})} \\ .......= {\color{Red} \sum _{i=1,n^l}\tfrac{\partial C}{\partial a_i^{l} }*\sigma ^{'}{(z^{l}_{i})}}*w^l_{ji}* \sigma ^{'}{(z^{l-1}_j))}\\ .......=[w^l_{j1},...,w^l_{jn^l}]*[\tfrac{\partial C}{\partial a_1^{l}}*\sigma^{'}(z_1^l),...,\tfrac{\partial C}{\partial a^{l}_{n^l}}*\sigma^{'}(z_{n^l}^l)] *\sigma^{'}(z^{l-1}_j))\\.......=[\tfrac{\partial C}{\partial w^{l}}]*[\tfrac{\partial C}{\partial z^l}]*\sigma^{'}(z_j^{l-1}) \tfrac{\partial C}{\partial z^{l-1}} = [\tfrac{\partial C}{\partial w^{l}}] *[\tfrac{\partial C}{\partial z^{l}}] *\sigma^{'}(z^{l-1})

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值