Pytorch（二）

最新推荐文章于 2021-09-17 13:59:25 发布

微风袭来

最新推荐文章于 2021-09-17 13:59:25 发布

阅读量124

点赞数

分类专栏： Pytorch

本文链接：https://blog.csdn.net/qq_43475173/article/details/100853806

版权

Pytorch 专栏收录该内容

9 篇文章 0 订阅

订阅专栏

随机数

torch.manual_seed(7)
features = torch.randn((1,5))
weights = torch.randn_like(features)
bias = torch.randn((1,1))

乘法

点乘

直接使用乘号

矩阵乘法

# 乘法规则比较严谨，推荐使用
torch.mm(a,b)
# 乘法不严格
torch.matmul()

矩阵变形

# 复制变形，不能更改原始数据形状
.reshape(a,b)
# 原地变形，但规则不严格，较随意，可能出现数据丢失，或者增加空白数据的情况
.resize_(a,b)
# 复制变形，规则严谨，会报错，只能对连续型的tensor使用，推荐使用
.view(a,b)

numpy to tensor

numpy与tensor，这两个数据类型在转换时共用内存，所以当你把numpy a，转换为tensor b后，改变 b 的值，a 的值也同时改变

神经网络的搭建

第一个版本：

class Network(nn.Module):
    def __init__(self):
        super().__init__()
        
        # Inputs to hidden layer linear transformation
        self.hidden = nn.Linear(784, 256)
        # Output layer, 10 units - one for each digit
        self.output = nn.Linear(256, 10)
        
        # Define sigmoid activation and softmax output 
        self.sigmoid = nn.Sigmoid()
        self.softmax = nn.Softmax(dim=1)
        
    def forward(self, x):
        # Pass the input tensor through each of our operations
        x = self.hidden(x)
        x = self.sigmoid(x)
        x = self.output(x)
        x = self.softmax(x)
        
        return x

第二个版本：

import torch.nn.functional as F

class Network(nn.Module):
    def __init__(self):
        super().__init__()
        # Inputs to hidden layer linear transformation
        self.hidden = nn.Linear(784, 256)
        # Output layer, 10 units - one for each digit
        self.output = nn.Linear(256, 10)
        
    def forward(self, x):
        # Hidden layer with sigmoid activation
        x = F.sigmoid(self.hidden(x))
        # Output layer with softmax activation
        x = F.softmax(self.output(x), dim=1)
        
        return x

第三个版本（使用nn.Sequential，极力推荐）

# Hyperparameters for our network
input_size = 784
hidden_sizes = [128, 64]
output_size = 10

# Build a feed-forward network
model = nn.Sequential(nn.Linear(input_size, hidden_sizes[0]),
                      nn.ReLU(),
                      nn.Linear(hidden_sizes[0], hidden_sizes[1]),
                      nn.ReLU(),
                      nn.Linear(hidden_sizes[1], output_size),
                      nn.Softmax(dim=1))
print(model)

第四个版本（可以给过程命名）：

from collections import OrderedDict
model = nn.Sequential(OrderedDict([
                      ('fc1', nn.Linear(input_size, hidden_sizes[0])),
                      ('relu1', nn.ReLU()),
                      ('fc2', nn.Linear(hidden_sizes[0], hidden_sizes[1])),
                      ('relu2', nn.ReLU()),
                      ('output', nn.Linear(hidden_sizes[1], output_size)),
                      ('softmax', nn.Softmax(dim=1))]))
model

前向传播

model.forward(x)

交叉熵损失函数的反向传播

# Build a feed-forward network
model = nn.Sequential(nn.Linear(784, 128),
                      nn.ReLU(),
                      nn.Linear(128, 64),
                      nn.ReLU(),
                      nn.Linear(64, 10),
                      nn.LogSoftmax(dim=1))

criterion = nn.NLLLoss()
images, labels = next(iter(trainloader))
images = images.view(images.shape[0], -1)

logps = model(images)
loss = criterion(logps, labels)

MNIST的完整训练过程

import torch
from torch import nn
import torch.nn.functional as F
from torchvision import datasets,transforms
from torch import optim
# Define a transform to normalize the data
transform = transforms.Compose([transforms.ToTensor(),
                                transforms.Normalize((0.5,), (0.5,)),
                              ])
# Download and load the training data
trainset = datasets.MNIST('~/.pytorch/MNIST_data/', download=True, train=True, transform=transform)
trainloader = torch.utils.data.DataLoader(trainset, batch_size=64, shuffle=True)
## Your solution here

model = nn.Sequential(nn.Linear(784, 128),
                      nn.ReLU(),
                      nn.Linear(128, 64),
                      nn.ReLU(),
                      nn.Linear(64, 10),
                      nn.LogSoftmax(dim=1))

criterion = nn.NLLLoss()
optimizer = optim.SGD(model.parameters(), lr=0.003)

epochs = 10
for e in range(epochs):
    running_loss = 0
    for images, labels in trainloader:
        # Flatten MNIST images into a 784 long vector
        images = images.view(images.shape[0], -1)
    
        # TODO: Training pass
        
        optimizer.zero_grad()
        loss = criterion(model(images),labels)
        loss.backward()
        optimizer.step()
        
        running_loss += loss.item()
    else:
        print(f"Training loss: {running_loss/len(trainloader)}")

%matplotlib inline
import helper

images, labels = next(iter(trainloader))

img = images[0].view(1, 784)
# Turn off gradients to speed up this part
with torch.no_grad():
    logps = model(img)

# Output of the network are log-probabilities, need to take exponential for probabilities
ps = torch.exp(logps)
helper.view_classify(img.view(1, 28, 28), ps)

在这里插入图片描述

微风袭来

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Pytorch（二）

随机数torch.manual_seed(7)features = torch.randn((1,5))weights = torch.randn_like(features)bias = torch.randn((1,1))乘法点乘直接使用乘号矩阵乘法# 乘法规则比较严谨，推荐使用torch.mm(a,b)# 乘法不严格torch.matmul()矩阵变形# 复制...
复制链接

扫一扫