深度学习之PyTorch实战(3)——实战手写数字识别 - 战争热诚 - 博客园
3D机器学习(2):pytorch入门:pytorch优点,梯度下降法初步_朱攀乖的博客-CSDN博客_pytorch的优点
PyTorch 资源链接
- 图像分类
- VGG
- ResNet
- DenseNet
- MobileNetV2
- ResNeXt
- SqueezeNet
- ShuffleNet
- ShuffleNet V2
- 位姿估计
- CPM: Convolutional Pose Machines
- OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
- 物体检测
- SSD
- Faster R-CNN
- YoloV3
- FPN
- 语义分割
- DeepLabV3
- PSPNet
- DenseASPP
- 实例分割
- Mask R-CNN
和tensorflow的区别
1.相同点
- 开源
- 都有个强大的爹(Google/Facebook)
- 支持者众多,社区活跃
- 均支持python(显然,pytorch是python first)
2.不同点
- Tensorflow更倾向于工业应用,使用静态图计算,具有强大的移植性
- Pytorch作为后起之秀,受到了众多研究型学者的宠爱,语法简便,利用动态图计算,开发周期通常会比Tensorflow短一些。
Keras本身不是框架,而是一个位于其他DeepLearning框架之上的高级API。目前它支持TensorFlow,Theano和CNTK。Keras是迄今为止启动和运行最快最简单的框架。定义神经网络是直观的,使用功能性API允许人们将层定义为函数。
而PyTorch像Keras一样,它也抽象了深度网络编程的大部分混乱部分。PyTorch介于Keras和TensorFlow之间,比Keras拥有更灵活、更好的控制力,与此同时用户又不必做任何疯狂的声明式编程。
import torch
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim
from torchvision import datasets, transforms
from torch.autograd import Variable
# Training settings
batch_size = 64
# MNIST Dataset
train_dataset = datasets.MNIST(root='./mnist_data/',
train=True,
transform=transforms.ToTensor(),
download=True)
test_dataset = datasets.MNIST(root='./mnist_data/',
train=False,
transform=transforms.ToTensor())
# Data Loader (Input Pipeline)
train_loader = torch.utils.data.DataLoader(dataset=train_dataset,
batch_size=batch_size,
shuffle=True)
test_loader = torch.utils.data.DataLoader(dataset=test_dataset,
batch_size=batch_size,
shuffle=False)
class Net(nn.Module):
def __init__(self):
super(Net, self).__init__()
self.l1 = nn.Linear(784, 520)
self.l2 = nn.Linear(520, 320)
self.l3 = nn.Linear(320, 240)
self.l4 = nn.Linear(240, 120)
self.l5 = nn.Linear(120, 10)
def forward(self, x):
# Flatten the data (n, 1, 28, 28) --> (n, 784)
x = x.view(-1, 784)
x = F.relu(self.l1(x))
x = F.relu(self.l2(x))
x = F.relu(self.l3(x))
x = F.relu(self.l4(x))
return F.log_softmax(self.l5(x), dim=1)
#return self.l5(x)
model = Net()
optimizer = optim.SGD(model.parameters(), lr=0.01, momentum=0.5)
def train(epoch):
# 每次输入barch_idx个数据
for batch_idx, (data, target) in enumerate(train_loader):
data, target = Variable(data), Variable(target)
optimizer.zero_grad()
output = model(data)
# loss
loss = F.nll_loss(output, target)
loss.backward()
# update
optimizer.step()
if batch_idx % 200 == 0:
print('Train Epoch: {} [{}/{} ({:.0f}%)]\tLoss: {:.6f}'.format(
epoch, batch_idx * len(data), len(train_loader.dataset),
100. * batch_idx / len(train_loader), loss.item()))
def test():
test_loss = 0
correct = 0
# 测试集
for data, target in test_loader:
data, target = Variable(data, volatile=True), Variable(target)
output = model(data)
# sum up batch loss
test_loss += F.nll_loss(output, target).item()
# get the index of the max
pred = output.data.max(1, keepdim=True)[1]
correct += pred.eq(target.data.view_as(pred)).cpu().sum()
test_loss /= len(test_loader.dataset)
print('\nTest set: Average loss: {:.4f}, Accuracy: {}/{} ({:.0f}%)\n'.format(
test_loss, correct, len(test_loader.dataset),
100. * correct / len(test_loader.dataset)))
for epoch in range(1,6):
train(epoch)
test()