Classification Model:Resnet

最新推荐文章于 2024-04-30 15:11:08 发布

Grant|zgh

最新推荐文章于 2024-04-30 15:11:08 发布

阅读量338

点赞数

文章标签： pytorch 深度学习神经网络

本文链接：https://blog.csdn.net/weixin_42963486/article/details/106061562

版权

Since the convolutional neural network is applied to image classification, the structure of the network has continued to develop and optimize. From Lenet, AlexNet, VGGNet, to ResNet, the depth of the network is getting larger and larger.So here I want to arise a question :Do networks stacking more layers have a better effect of recognition and lower error?
Did you get you idea?The answer in the Resnet paper is No.
在这里插入图片描述
From the figure above,we can get a conclusion that the deeper network has a higher error than the shallower one.So what caused this result?

An obstacle to answering this question was the notorious problem of vanishing/exploding gradients， which hamper convergence from the beginning.（When training a very deep neural network, as the number of layers increases, there will be an exponential drop in the gradient transmission during the back propagation process, which will cause the gradient to disappear. Or an exponential increase, leading to a gradient explosion）.

This problem,however, has been largely addressed by normalized initialization and intermediate normalization layers, which enable networks with tens of layers to start converging for stochastic gradient descent (SGD) with backpropagation。

The biggest problem caused by the increasing depth of the network is degradation.with the network depth increasing, accuracy gets saturated (which might be unsurprising) and then degrades rapidly. Unexpectedly,such degradation is not caused by overfitting, and adding more layers to a suitably deep model leads to higher training error.

In the paper,the writer successfully tackle the degradation problem by introducing a deep residual learning framework as followes.

在这里插入图片描述
According to whether the number of input and output channels of shortcut is the same as a residual mapping ,the writer propose two residual modules .

The Resnet can have many layers.According to the different layers, the residual network can be divided into five categories
在这里插入图片描述
Then the construction of the residual network is as follows.

The basic block and bottleneck:

#resnet18 and resnet34
class BasicBlock(nn.Module):
    expansion = 1

    def __init__(self, in_channel, out_channel, stride=1, downsample=None):
        super(BasicBlock, self).__init__()
        self.conv1 = nn.Conv2d(in_channels=in_channel, out_channels=out_channel,
                               kernel_size=3, stride=stride, padding=1, bias=False)
        self.bn1 = nn.BatchNorm2d(out_channel)
        self.relu = nn.ReLU()
        self.conv2 = nn.Conv2d(in_channels=out_channel, out_channels=out_channel,
                               kernel_size=3, stride=1, padding=1, bias=False)
        self.bn2 = nn.BatchNorm2d(out_channel)
        self.downsample = downsample

    def forward(self, x):
        identity = x
        if self.downsample is not None:
            identity = self.downsample(x)

        out = self.conv1(x)
        out = self.bn1(out)
        out = self.relu(out)

        out = self.conv2(out)
        out = self.bn2(out)

        out += identity
        out = self.relu(out)

        return out

#resnet51、resnet101、resnet152
class Bottleneck(nn.Module):
    expansion = 4

    def __init__(self, in_channel, out_channel, stride=1, downsample=None):
        super(Bottleneck, self).__init__()
        self.conv1 = nn.Conv2d(in_channels=in_channel, out_channels=out_channel,
                               kernel_size=1, stride=1, bias=False)  # squeeze channels
        self.bn1 = nn.BatchNorm2d(out_channel)
        self.conv2 = nn.Conv2d(in_channels=out_channel, out_channels=out_channel,
                               kernel_size=3, stride=stride, bias=False, padding=1)
        self.bn2 = nn.BatchNorm2d(out_channel)

        self.conv3 = nn.Conv2d(in_channels=out_channel, out_channels=out_channel*self.expansion,
                               kernel_size=1, stride=1, bias=False)  # unsqueeze channels
        self.bn3 = nn.BatchNorm2d(out_channel*self.expansion)
        self.relu = nn.ReLU(inplace=True)
        self.downsample = downsample

    def forward(self, x):
        identity = x
        if self.downsample is not None:
            identity = self.downsample(x)

        out = self.conv1(x)
        out = self.bn1(out)
        out = self.relu(out)

        out = self.conv2(out)
        out = self.bn2(out)
        out = self.relu(out)

        out = self.conv3(out)
        out = self.bn3(out)

        out += identity
        out = self.relu(out)

        return out

The Resnet:

class ResNet(nn.Module):

    def __init__(self, block, blocks_num, num_classes=1000, include_top=True):
        super(ResNet, self).__init__()
        self.include_top = include_top
        self.in_channel = 64

        self.conv1 = nn.Conv2d(3, self.in_channel, kernel_size=7, stride=2,
                               padding=3, bias=False)
        self.bn1 = nn.BatchNorm2d(self.in_channel)
        self.relu = nn.ReLU(inplace=True)
        self.maxpool = nn.MaxPool2d(kernel_size=3, stride=2, padding=1)
        self.layer1 = self._make_layer(block, 64, blocks_num[0])
        self.layer2 = self._make_layer(block, 128, blocks_num[1], stride=2)
        self.layer3 = self._make_layer(block, 256, blocks_num[2], stride=2)
        self.layer4 = self._make_layer(block, 512, blocks_num[3], stride=2)
        if self.include_top:
            self.avgpool = nn.AdaptiveAvgPool2d((1, 1))  # output size = (1, 1)
            self.fc = nn.Linear(512 * block.expansion, num_classes)

        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')

    def _make_layer(self, block, channel, block_num, stride=1):
        downsample = None
        if stride != 1 or self.in_channel != channel * block.expansion:
            downsample = nn.Sequential(
                nn.Conv2d(self.in_channel, channel * block.expansion, kernel_size=1, stride=stride, bias=False),
                nn.BatchNorm2d(channel * block.expansion))

        layers = []
        layers.append(block(self.in_channel, channel, downsample=downsample, stride=stride))
        self.in_channel = channel * block.expansion

        for _ in range(1, block_num):
            layers.append(block(self.in_channel, channel))

        return nn.Sequential(*layers)

    def forward(self, x):
        x = self.conv1(x)
        x = self.bn1(x)
        x = self.relu(x)
        x = self.maxpool(x)

        x = self.layer1(x)
        x = self.layer2(x)
        x = self.layer3(x)
        x = self.layer4(x)

        if self.include_top:
            x = self.avgpool(x)
            x = torch.flatten(x, 1)
            x = self.fc(x)

        return x

if you don’t want to construct the network by yourself,the torchvision.models provides some pre-training models which will help you a lot.

from torchvision.models import resnet34
net = resnet34(pretrained=False)
inchannel = net.fc.in_features
net.fc = nn.Linear(inchannel, numclass)

Let’s see the recognition effect of the Resnet.
在这里插入图片描述

论文:https://arxiv.org/abs/1512.03385

Grant|zgh

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Classification Model:Resnet

Since the convolutional neural network is applied to image classification, the structure of the network has continued to develop and optimize. From Lenet, AlexNet, VGGNet, to ResNet, the depth of the network is getting larger and larger.So here I want to
复制链接

扫一扫

Classification Model:Resnet

“相关推荐”对你有帮助么？