Pytorch多GPU的计算和Sync BatchNorm

Wanderer001

已于 2023-11-02 20:27:04 修改

阅读量4.9k

点赞数 2

分类专栏： Pytorch 文章标签：计算机视觉深度学习机器学习

于 2022-05-18 10:05:19 首次发布

本文链接：https://blog.csdn.net/weixin_36670529/article/details/104349669

版权

nn.DataParallelpytorch中使用GPU非常方便和简单：import torchimport torch.nn as nninput_size = 5output_size = 2class Model(nn.Module): def __init__(self, input_size, output_size): super(Mod...

摘要由CSDN通过智能技术生成

参考 Pytorch多GPU的计算和Sync BatchNorm - 云+社区 - 腾讯云

nn.DataParallel

pytorch中使用GPU非常方便和简单：

import torch
import torch.nn as nn

input_size = 5
output_size = 2

class Model(nn.Module):

    def __init__(self, input_size, output_size):
        super(Model, self).__init__()
        self.fc = nn.Linear(input_size, output_size)

    def forward(self, input):
        output = self.fc(input)
        print("[In Model]: device",torch.cuda.current_device() ," input size", input.size()," output size", output.size())
        return output


device = torch.device('cuda:0')

model = Model(input_size, output_size)
model.to(device)

x = torch.Tensor(2,5)
x = x.to(device)
y = model(x)

这里需要注意的是，仅仅调用Tensor.to()只会在GPU上返回一个新的copy，并不会对原来的引用造成变化，因此需要通过赋值rewrite。

上述只是对单个GPU的使用方法，对于多个GPU，pytorch也提供了封装好的接口——DataParallel,只需要将model 对象放入容器中即可：

model = Model(input_size, output_size)

print("Let's use", torch.cuda.device_count(), "GPUs!\n")
model = nn.DataParallel(model)
model.to(device)
print(model)


# output
# Let's use 2 GPUs!

DataParallel(
  (module): Model(
    (fc): Linear(in_features=5, out_features=2, bias=True)
  )
)

看到这次输出的model外面还有一层DataParallel，但这里并没有体现出存在多个GPU。

接下来构造一个Dummy DataSet&

最低0.47元/天解锁文章

Wanderer001

关注

2
点赞
踩
6

收藏

觉得还不错? 一键收藏
打赏
0
评论
Pytorch多GPU的计算和Sync BatchNorm

nn.DataParallelpytorch中使用GPU非常方便和简单：import torchimport torch.nn as nninput_size = 5output_size = 2class Model(nn.Module): def __init__(self, input_size, output_size): super(Mod...
复制链接

扫一扫