Resnet论文阅读与复现

猴猴猪猪

已于 2023-12-24 15:38:54 修改

阅读量224

点赞数

分类专栏：论文笔记文章标签： python3 机器学习

于 2019-05-13 11:08:45 首次发布

本文链接：https://blog.csdn.net/pku_langzi/article/details/90169413

版权

论文笔记专栏收录该内容

16 篇文章 1 订阅

订阅专栏

最近梳理ResNet原始论文https://arxiv.org/pdf/1512.03385.pdf，并进行tensorflow复现，最后介绍后续诸路大神，在ResNet上面的魔改。

文章目录

1.Motivation
2.Solution
3.Advantages
4.Network

1.Motivation

Degradation Problem
Based on the observation that with stacking the layers, accuracy gets saturated and then degrades rapidly, which means that not all systems are similarly easy to optimize.

2.Solution

To address this degradation problem, this paper introduce a deep residual learning framework.
Let the stack layers to fit another mapping, namely $F (x) = H (x) - x$ . The original mapping is recast into $F (X) + x$ .

This paper hypothesize that it easier to optimize the residual mapping rathre than the original one, unreferenced mapping.

Shortcut connections [2, 34, 49] are those skipping one or more layers. In our case, the shortcut connections simply perform identity mapping, and their outputs are added to the outputs of the stacked layers (Fig. 2).

3.Advantages

Easy to optimize, but the counterpart “plain” nets (that simply stack layers) exhibit higher training error when the depth increases;
Easily enjoy accuracy gains from greatly increased depth, producing results substantially better than previous networks.

4.Network

在这里插入图片描述
When the dimensions increase (dotted line shortcuts in Fig. 3), we consider two options: (A) The shortcut still performs identity mapping, with extra zero entries padded for increasing dimensions. This option introduces no extra parameter; (B) The projection shortcut in Eqn.(2) is used to match dimensions (done by 1×1 convolutions). For both options, when the shortcuts go across feature maps of two sizes, they are performed with a stride of 2.
在这里插入图片描述
Model