Pytorch学习笔记（深度之眼）（4）之模型容器

最新推荐文章于 2023-12-19 23:18:37 发布

liuyu进阶

最新推荐文章于 2023-12-19 23:18:37 发布

阅读量152

点赞数

分类专栏：深度学习 python 笔记文章标签： python 深度学习神经网络

本文链接：https://blog.csdn.net/m0_45866718/article/details/111819500

版权

笔记同时被 3 个专栏收录

45 篇文章 2 订阅

订阅专栏

深度学习

21 篇文章 2 订阅

订阅专栏

python

17 篇文章 1 订阅

订阅专栏

1.容器.Containers

在这里插入图片描述 nn.Sequential是nn.module的容器，用于按顺序包装一组网络层。

code

import torch.nn as nn


class LeNetSequential(nn.Module):
    def __init__(self, classes):
        super(LeNetSequential, self).__init__()
        self.features = nn.Sequential(
            nn.Conv2d(3, 6, 5),
            nn.ReLU(),
            nn.MaxPool2d(kernel_size=2, stride=2),
            nn.Conv2d(6, 16, 5),
            nn.ReLU(),
            nn.MaxPool2d(kernel_size=2, stride=2),)

        self.classifier = nn.Sequential(
            nn.Linear(16*5*5, 120),
            nn.ReLU(),
            nn.Linear(120, 84),
            nn.ReLU(),
            nn.Linear(84, classes),)

    def forward(self, x):
        x = self.features(x)
        x = x.view(x.size()[0], -1)  # 展开，形状变换
        x = self.classifier(x)
        return x

在__init__()模块中，采用sequential()对卷积层和池化层进行包装，把sequential类属性赋予feature，然后对三个全连接层进行sequential包装，赋值为classifier类属性，这就完成了模型构建的第一步。foward构建了前向传播过程，只有三行，非常简洁。

我们用sequential构建LeNet，LeNet中有一个features，类型为sequential，sequential中有六个网络层，以序号(0)-(5)命名；还有一个classifier，一样是sequential。这里存在一个问题，这里的网络层是没有名字的，是通过序号索引的，如果在一个上千层的网络中，很难采用序号去进行索引每一个网络层。这时候可以对网络层进行命名，这就是第二种sequential的方法，对sequential输入一个头有序的字典，以这种方式构建网络，代码如下所示。

class LeNetSequentialOrderDict(nn.Module):
    def __init__(self, classes):
        super(LeNetSequentialOrderDict, self).__init__()

        self.features = nn.Sequential(OrderedDict({
            'conv1': nn.Conv2d(3, 6, 5),
            'relu1': nn.ReLU(inplace=True),
            'pool1': nn.MaxPool2d(kernel_size=2, stride=2),

            'conv2': nn.Conv2d(6, 16, 5),
            'relu2': nn.ReLU(inplace=True),
            'pool2': nn.MaxPool2d(kernel_size=2, stride=2),
        }))

        self.classifier = nn.Sequential(OrderedDict({
            'fc1': nn.Linear(16*5*5, 120),
            'relu3': nn.ReLU(),

            'fc2': nn.Linear(120, 84),
            'relu4': nn.ReLU(inplace=True),

            'fc3': nn.Linear(84, classes),
        }))

    def forward(self, x):
        x = self.features(x)
        x = x.view(x.size()[0], -1)
        x = self.classifier(x)
        return x

总结

nn.Sequential是nn.module的容器，用于按顺序包装一组网络层：

顺序性：各网络层之间严格按照顺序构建；
自带forward()：自带的forward里，通过for循环依次执行前向传播运算；

2.容器.ModuleList

![在这里插入图片描述](https://img-blog.csdnimg.cn/20201227173401577.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L20wXzQ1ODY2NzE4,size_16,color_FFFFFF,t_70

class ModuleList(nn.Module):
    def __init__(self):
        super(ModuleList, self).__init__()
        self.linears = nn.ModuleList([nn.Linear(10, 10) for i in range(20)])  # 列表生成式，采用for循环

    def forward(self, x):
        for i, linear in enumerate(self.linears):
            x = linear(x)
        return x
        
net = ModuleList()

通过单步调试，进入了nn.ModuleList的init()函数当中，通过代码可以发现，如果mpdules不是空的话，则会不断进行叠加，得到linear。

可以看到，通过nn.ModuleList可以简便地建立一个二十层的全连接网络模型。

3.容器.moduleLDict

在这里插入图片描述

class ModuleDict(nn.Module):
    def __init__(self):
        super(ModuleDict, self).__init__()
        self.choices = nn.ModuleDict({
            'conv': nn.Conv2d(10, 10, 3),
            'pool': nn.MaxPool2d(3)
        })

        self.activations = nn.ModuleDict({
            'relu': nn.ReLU(),
            'prelu': nn.PReLU()
        })

    def forward(self, x, choice, act):
        x = self.choices[choice](x)
        x = self.activations[act](x)
        return x

net = ModuleDict()

fake_img = torch.randn((4, 10, 32, 32))

output = net(fake_img, 'conv', 'relu')