使用PyTorch实现CNN
文章目录
1. 导入所需包:
import torch
from torch.utils import data # 获取迭代数据
from torch.autograd import Variable # 获取变量
import torchvision
from torchvision.datasets import mnist # 获取数据集
import matplotlib.pyplot as plt
2. 获取数据集
2.1 获取数据集,并对数据集进行预处理
(1)对原有数据转成Tensor类型
(2)用平均值和标准偏差归一化张量图像
# 数据集的预处理
data_tf = torchvision.transforms.Compose(
[
torchvision.transforms.ToTensor(),
torchvision.transforms.Normalize([0.5],[0.5])
]
)
data_path = r'C:\Users\liev\Desktop\myproject\yin_test\MNIST_DATA_PyTorch'
# 获取数据集
train_data = mnist.MNIST(data_path,train=True,transform=data_tf,download=False)
test_data = mnist.MNIST(data_path,train=False,transform=data_tf,download=False)
第一次下载的输出:
Downloading http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz
Downloading http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz
Processing...
Done!
注意:
- 对数据的预处理还有很多。
- 第一次获取数据集时,参数download=True,会下载MNIST数据集所有文件,包括训练集和测试集
- 获取MNIST数据集的步骤:
-
如果本地没有数据集:
-
train_data = mnist.MNIST(data_path,train=True,transform=data_tf,download=True)
-
等待下载,直到下载完成
-
train_data = mnist.MNIST(data_path,train=True,transform=data_tf,download=False) test_data = mnist.MNIST(data_path,train=False,transform=data_tf,download=False)
-
获取测试集和训练集
-
-
如果本地有数据集
-
train_data = mnist.MNIST(data_path,train=True,transform=data_tf,download=False) test_data = mnist.MNIST(data_path,train=False,transform=data_tf,download=False)
-
2.2 获取迭代数据:data.DataLoader()
train_loader = data.DataLoader(train_data,batch_size=128,shuffle=True)
test_loader = data.DataLoader(test_data,batch_size=100,shuffle=True)
注意:
- DataLoader返回的是所有的数据,只是分成了每批次为参数batch_size的数据
- DataLoader的shuffle参数,True 决定了是否能多次取出batch_size,False,则表明只能取出数据集大小的数据。
3. 定义网络结构
CNN网络结构 | 输入shape | 卷积核 | 激活函数 | 输出图像 |
---|---|---|---|---|
conv1 | [128,1,28,28] | [3,3,1,16] | ReLU | [128, 16, 14, 14] |
conv2 | [128, 16, 14, 14] | [3,3,16,32] | ReLU | [128, 32, 7, 7] |
conv3 | [128, 32, 7, 7] | [3,3,32,64] | ReLU | [128, 64, 4, 4] |
conv4 | [128, 64, 4, 4] | [3,3,64,64] | ReLU | [128, 64, 2, 2] |
代码实现:
# 定义网络结构
class CNNnet(torch.nn.Module):
def __init__(self):
super(CNNnet,self).__init__()
self.conv1 = torch.nn.Sequential(
torch.nn.Conv2d(in_channels=1,
out_channels=16,
kernel_size=3,
stride=2,
padding=1),
torch.nn.BatchNorm2d(16),
torch.nn.ReLU()
)
self.conv2 = torch.nn.Sequential(
torch.nn.Conv2d(16,32,3,2,1),
torch.nn.BatchNorm2d(32),
torch.nn.ReLU()
)
self.conv3 = torch.nn.Sequential(
torch.nn.Conv2d(32,64,3,2,1),
torch.nn.BatchNorm2d(64),
torch.nn.ReLU()
)
self.conv4 = torch.nn.Sequential(
torch.nn.Conv2d(64,64,2,2,0),
torch.nn.BatchNorm2d(64),
torch.nn.ReLU()
)
self.mlp1 = torch.nn.Linear(2*2*64,100)
self.mlp2 = torch.nn.Linear(100,10)
def forward(self, x):
x = self.conv1(x)
x = self.conv2(x)
x = self.conv3(x)
x = self.conv4(x)
x = self.mlp1(x.view(x.size(0),-1))
x = self.mlp2(x)
return x
model = CNNnet()
print(model)
输出:
CNNnet(
(conv1): Sequential(
(0): Conv2d(1, 16, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
(1): BatchNorm2d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(2): ReLU()
)
(conv2): Sequential(
(0): Conv2d(16, 32, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1))
(1): BatchNorm2d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(2): ReLU()
)