【计算机科学】【2019.06】卷积神经网络的贪婪分层训练

最新推荐文章于 2022-09-17 19:03:50 发布

梅花香——苦寒来

最新推荐文章于 2022-09-17 19:03:50 发布

阅读量478

点赞数

原文链接：https://mp.weixin.qq.com/s?__biz=MzUxMTk0OTA3Nw==&mid=2247512411&idx=1&sn=320d64a50571e9933a3c5b6624dd6a14&chksm=f9691121ce1e983792fe199b12cbe89277ab7dafb0ab48c12318189003472bda9848dc01edaf&token=1065738798&lang=zh_CN#rd

版权

在这里插入图片描述

本文为美国麻省理工学院（作者：Loc Quang Trinh）的硕士论文，共63页。

分层训练是一种端到端反向传播的深度卷积神经网络训练方法。尽管以前的工作未能证明分层训练的可行性，特别是在ImageNet等大规模数据集上，但最近的工作表明，在特定体系结构上的分层训练可以产生极具竞争力的性能。在ImageNet上，分层训练网络的性能与许多最先进的端到端训练网络相当。

在这篇论文中，我们比较了两种训练方法在各种网络架构下的性能差距，并进一步分析了分层训练可能存在的局限性。我们的研究结果表明，在经过一定的关键层后，分层训练很快饱和，原因是过度拟合造成的。我们将讨论几种解决此问题的方法，并帮助跨多个体系结构改进分层训练。从根本上讲，本研究强调打开黑盒，即现代深度神经网络，通过分层训练的视角研究深度网络中间隐藏层之间的分层交互作用。

Layerwise training presents an alternative approach to end-to-end back-propagation for training deep convolutional neural networks. Although previous work was unsuccessful in demonstrating the viability of layerwise training, especially on large-scale datasets such as ImageNet, recent work has shown that layerwise training on specific architectures can yield highly competitive performances. On ImageNet, the layerwise trained networks can perform comparably to many state-of-the-art end-to-end trained networks. In this thesis, we compare the performance gap between the two training procedures across a wide range of network architectures and further analyze the possible limitations of layerwise training. Our results show that layerwise training quickly saturates after a certain critical layer, due to the overfitting of early layers within the networks. We discuss several approaches we took to address this issue and help layerwise training improve across multiple architectures. From a fundamental standpoint, this study emphasizes the need to open the blackbox that is modern deep neural networks and investigate the layerwise interactions between intermediate hidden layers within deep networks, all through the lens of layerwise training.

```
  引言
```
分层训练
理解学习表示方式
分层训练的局限性
减速训练
结论

更多精彩文章请关注公众号：在这里插入图片描述

梅花香——苦寒来

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【计算机科学】【2019.06】卷积神经网络的贪婪分层训练

本文为美国麻省理工学院（作者：Loc Quang Trinh）的硕士论文，共63页。分层训练是一种端到端反向传播的深度卷积神经网络训练方法。尽管以前的工作未能证明分层训练的可行性，特别是在ImageNet等大规模数据集上，但最近的工作表明，在特定体系结构上的分层训练可以产生极具竞争力的性能。在ImageNet上，分层训练网络的性能与许多最先进的端到端训练网络相当。在这篇论文中，我们比较了两种训练方法在各种网络架构下的性能差距，并进一步分析了分层训练可能存在的局限性。我们的研究结果表明，在经过一定的关键.
复制链接

扫一扫