LeNet的实现已经过程中的问题

最新推荐文章于 2023-07-14 11:18:13 发布

weixin_37682172

最新推荐文章于 2023-07-14 11:18:13 发布

阅读量355

点赞数

分类专栏： pytorch 文章标签： pycharm

本文链接：https://blog.csdn.net/weixin_37682172/article/details/121576231

版权

pytorch 专栏收录该内容

5 篇文章 0 订阅

订阅专栏

该博客介绍了如何使用PyTorch实现经典的LeNet网络，并用CIFAR10数据集进行训练。文章详细讲解了网络结构，包括卷积层、池化层和全连接层的设置，以及训练过程中的关键步骤，如损失函数、优化器的选择和训练循环。在训练过程中，博主遇到了一些问题，如super的使用、网络连接错误和训练状态管理等，这些问题都得到了解决。最后，展示了部分训练结果和测试准确性。

摘要由CSDN通过智能技术生成

首先是LeNet网络类的实现

from torchvision import datasets, transforms
import torch
from torch.utils.data import DataLoader
import torch.nn as nn
class LeNet(nn.Module):
    def __init__(self):
        super(LeNet, self).__init__()
        self.conv_layers = nn.Sequential(
            # [b, 3, 32, 32] batch, rgb3个channel
            nn.Conv2d(in_channels=3, out_channels=6, kernel_size=5, stride=1, padding=0),
            # [b, 3, 32, 32] 经过卷积层-> [b, 6, 28,28]
            nn.AvgPool2d(kernel_size=2, stride=2, padding=0),
            # [b,6,28,28] 经过pooling层 -> [b,6, 14,14]
            nn.Conv2d(6, 16, kernel_size=5, stride=1, padding=0),
            # [b,6,14,14] 经过卷积层 -> [b,16,10,10]
            nn.AvgPool2d(kernel_size=2,stride=2,padding=0)
            # [b,16,10,10] 经过pooling层 -> [b,16,5,5]
        )
        # flatten # [b,16,5,5] -> [16*5*5]
        # FC
        self.fc_layers = nn.Sequential(
            nn.Linear(in_features=16*5*5, out_features=120),
            nn.ReLU(),
            nn.Linear(120,84),
            nn.ReLU(),
            nn.Linear(84, 10)
        )
    def forward(self, x):
        batchsz = x.size(0)
        x = self.conv_layers(x)
        x = x.reshape(batchsz, 16*5*5)
        # [b,16,5,5] -> [b, 16*5*5]
        logit = self.fc_layers(x)
        return logit

中间遇到的问题有，

没写super(LeNet, self).init() 原因：不理解super的作用，其作用是先创建一个LeNet类的实例，然后将这个对象转换为其父类nn.Module的实例，然后调用这个父类实例的__init__()方法进行初始化
最后一个pooling层连接fc层不会计算原因：不理解网络中的batch维度是超脱网络其他超参数的，实际上网络中的参数都是对于一张32x32图片而言的，而batch就是有多少张图片同时在训练。而pooling的16个channel，每个channel 5x5转到 120的fc层就是[16,5,5] 打平到[1655] 或者说是[batchsize,16,5,5] -》 [batchsize,1655]
获取网络某一层第一维度的长度可以用 x.size(0)

下面是训练的代码

from torchvision import datasets, transforms
import torch
from torch.utils.data import DataLoader
import torch.nn as nn
import torch.nn.functional as F
from lenet import LeNet

batchsz = 32
cifar_train = datasets.CIFAR10('../cifar', train=True, transform=transforms.Compose([
    transforms.Resize((32, 32)),
    transforms.ToTensor()
]), download=True)
cifar_train = DataLoader(cifar_train, batch_size=batchsz, shuffle=True)
cifar_test = datasets.CIFAR10('../cifar', train=False, transform=transforms.Compose([
    transforms.Resize((32, 32)),
    transforms.ToTensor()
]), download=True)
cifar_test = DataLoader(cifar_test, batch_size=batchsz, shuffle=True)
model = LeNet()
# device = torch.device('cuda')
# model = LeNet().to(device)
criterion = nn.CrossEntropyLoss()
optimizer = torch.optim.Adam(model.parameters(),lr=1e-3)
print(model)
model.train()
for epoch in range(10):
    for batchidx, (x, label) in enumerate(cifar_train):
        # x [b,3,32,32] label [b]
        # x, label = x.to(device), label.to(device)
        # 注意CrossEntropyLoss() 包括softmax，所以输入肯定是logits，不是pred
        logits = model(x)
        loss = criterion(logits, label)
        # label [b] logits [b,10]
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()
    print("epoch", epoch, "loss:", loss.item())
    #test
    model.eval()
    correct_num = 0
    total_num = 0
    with torch.no_grad():
        for (x, label) in cifar_test:
            # x[b,3,32,32] label [b]
            logits = model(x)
            # logits [b,10]
            pred = torch.argmax(logits, dim=1)
            correct_num += torch.eq(pred, label).float().sum().item()
            total_num += x.size(0)
        print('test accuarcy:', correct_num/total_num)

出现过的问题有

导包写的 import LeNet，那样的话下面就应该写model=lenet.Lenet(). 如果写 from lenet import LeNet 下面就可以写model=LeNet()
错写成loss = criterion(label, logits) CrossEntorpyLoss()顺序是重要的，预测值应该在前
test里的 with torch.no_grad() 错写成 with model.zero_grad()

最后，运行结果

D:\Anaconda\envs\pytorch\python.exe "D:/PycharmProjects/pythonProjecet_pytorch/LeNet & ResNet/LeNet/train.py"
Files already downloaded and verified
Files already downloaded and verified
LeNet(
  (conv_layers): Sequential(
    (0): Conv2d(3, 6, kernel_size=(5, 5), stride=(1, 1))
    (1): AvgPool2d(kernel_size=2, stride=2, padding=0)
    (2): Conv2d(6, 16, kernel_size=(5, 5), stride=(1, 1))
    (3): AvgPool2d(kernel_size=2, stride=2, padding=0)
  )
  (fc_layers): Sequential(
    (0): Linear(in_features=400, out_features=120, bias=True)
    (1): ReLU()
    (2): Linear(in_features=120, out_features=84, bias=True)
    (3): ReLU()
    (4): Linear(in_features=84, out_features=10, bias=True)
  )
)
epoch 0 loss: 1.37169349193573
test accuarcy: 0.4542
epoch 1 loss: 1.9433141946792603
test accuarcy: 0.4866
epoch 2 loss: 2.1775083541870117
test accuarcy: 0.5228
epoch 3 loss: 1.9912444353103638
test accuarcy: 0.5261
epoch 4 loss: 1.410996913909912
test accuarcy: 0.5381
epoch 5 loss: 0.9067156314849854
test accuarcy: 0.5481
epoch 6 loss: 1.1418999433517456
test accuarcy: 0.53
epoch 7 loss: 1.032881498336792
test accuarcy: 0.5518
epoch 8 loss: 1.4319336414337158
test accuarcy: 0.5499
epoch 9 loss: 0.9463621377944946
test accuarcy: 0.5431

Process finished with exit code 0

weixin_37682172

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
LeNet的实现已经过程中的问题

首先是LeNet网络类的实现from torchvision import datasets, transformsimport torchfrom torch.utils.data import DataLoaderimport torch.nn as nnclass LeNet(nn.Module): def __init__(self): super(LeNet, self).__init__() self.conv_layers = nn.Seque
复制链接

扫一扫

专栏目录