继续dropout

最新推荐文章于 2023-06-19 14:45:54 发布

黄发良的博客

最新推荐文章于 2023-06-19 14:45:54 发布

阅读量327

点赞数

分类专栏： RNN

本文链接：https://blog.csdn.net/falianghuang/article/details/73332000

版权

RNN 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

dropout VS. L2 VS ensemble learning

Ensemble learning using a different set of hidden units in every iteration (this is the dropout) performs better than when using the same set of hidden units throughout the learning.
Note that even with dropout learning using more hidden units than ensemble learning, overfitting did not occur
L2与dropout的正则化效果相当，在SGD+L2的配置中需要不断尝试学习速率α，而dropout没有对应微调参数。

Selective Dropout

文献：Barrow E, Eastwood M, Jayne C. Selective Dropout for Deep Neural Networks[M]// Neural Information Processing. Springer International Publishing, 2016.
方法：根据dropout率来决定每层需要dropout的单元数，分别以下面三个值来产生三个神经单元选择概率，值越大者越

权重变化度： $av{g_k} = \frac{1}{n}\sum\limits_{j = 1}^n {(|W_{jk}^{(i)} - W_{jk}^{(i - 1)}|)}$ ，变化越大则说明该单元还处于积极学习中，则dropout的概率要越低。
权重平均值： $av{g_k} = \frac{1}{n}\sum\limits_{j = 1}^n {(W_{jk}^{(i)})}$ ，该值越大意味着对应神经元基本学会，则其dropout的概率要越大。
输出方差： $N\_Varianc{e_k} = variance(X_k^{(i - 1)})$ ，该值越大意味着该单元基本稳定，则其dropout的概率要越大。

黄发良的博客

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
继续dropout

Selective Dropout文献：Barrow E, Eastwood M, Jayne C. Selective Dropout for Deep Neural Networks[M]// Neural Information Processing. Springer International Publishing, 2016. 方法：根据dropout率来决定每层需要dropout的单
复制链接

扫一扫