作业一:推导 Softmax 梯度

作业一:推导交叉熵 loss 下的 Softmax 梯度

Created: March 18, 2022 1:19 PM

L = − log ⁡ e s k ∑ j e s j L = - \log \frac{e^{s_k}}{\sum_j e^{s_j}} L=logjesjesk,求 ∂ L ∂ e s i \frac{\partial L}{\partial e^{s_i}} esiL

定义 p i = e s i ∑ j e s j p_i = \frac{e^{s_i}}{\sum_j e^{s_j}} pi=jesjesi

  • i = k i = k i=k:

    ∂ L ∂ e s i = − 1 p k ∂ p k ∂ s k = − 1 p k e s k ⋅ ∑ j e s j − e s k ⋅ e s k ( ∑ j e s j ) 2 = − 1 p k e s k ∑ j e s j ∑ j ≠ k e s j ∑ j e s j = − 1 p k e s k ∑ j e s j ( 1 − e s k ∑ j e s j ) = − 1 p k p k ( 1 − p k ) = p k − 1 \frac{\partial L}{\partial e^{s_i}} =- \frac{1}{p_k} \frac{\partial p_k}{\partial s_k} \\ =- \frac{1}{p_k} \frac{e^{s_k} \cdot \sum_j e^{s_j} - e^{s_k} \cdot e^{s_k}}{(\sum_j e^{s_j})^2} \\ =- \frac{1}{p_k} \frac{e^{s_k}}{\sum_j e^{s_j}} \frac{\sum_{j \neq k}e^{s_j}}{\sum_j e^{s_j}} \\ =- \frac{1}{p_k} \frac{e^{s_k}}{\sum_j e^{s_j}}(1-\frac{e^{s_k}}{\sum_j e^{s_j}}) \\ =- \frac{1}{p_k} p_k (1-p_k) \\ = p_k - 1 esiL=pk1skpk=pk1(jesj)2eskjesjeskesk=pk1jesjeskjesjj=kesj=pk1jesjesk(1jesjesk)=pk1pk(1pk)=pk1

  • i ≠ k i \neq k i=k
    ∂ L ∂ e s i = − 1 p i ∂ p i ∂ s i = − 1 p k − e s k e s i ( ∑ j e s j ) 2 = 1 p k e s k ∑ j e s j e s i ∑ j e s j = 1 p k ⋅ p k ⋅ p i = p i \frac{\partial L}{\partial e^{s_i}} =- \frac{1}{p_i} \frac{\partial p_i}{\partial s_i} \\=- \frac{1}{p_k} \frac{-e^{s_k} e^{s_i}}{(\sum_j e^{s_j})^2} \\=\frac{1}{p_k} \frac{e^{s_k}}{\sum_j e^{s_j}} \frac{e^{s_i}}{\sum_j e^{s_j}} \\=\frac{1}{p_k} \cdot p_k \cdot p_i \\= p_i esiL=pi1sipi=pk1(jesj)2eskesi=pk1jesjeskjesjesi=pk1pkpi=pi

  • 总结
    ∂ L ∂ e s i = { p k − 1 , i = k p i , i ≠ k \frac{\partial L}{\partial e^{s_i}} =\begin{cases}p_k - 1 , \quad i = k \\p_i , \quad i \neq k\end{cases} esiL={pk1,i=kpi,i=k

  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 1
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值