pytorch讲解（部分）

猛码Memmat

已于 2023-09-25 23:03:38 修改

阅读量625

点赞数 2

分类专栏： library / tool 文章标签： pytorch python 深度学习

于 2023-05-23 11:24:41 首次发布

本文链接：https://blog.csdn.net/JishuFengyang/article/details/130822880

版权

友爱的目录

自动求导机制
CUDA语义
- 最佳实践
- - 使用固定的内存缓冲区
  - 使用 nn.DataParallel 替代 multiprocessing
扩展PyTorch
- 扩展 torch.autograd
- 扩展 torch.nn
多进程最佳实践
序列化语义
PACKAGE参考
TorchVision参考
其他：timm（PyTorch Image Model）库
参考文献

自动求导机制

了解这些并不是绝对必要的，但我们建议您熟悉它，因为它将帮助您编写更高效，更简洁的程序，并可帮助您进行调试。

从后向中排除子图

每个变量都有两个标志：requires_grad和volatile。它们都允许从梯度计算中精细地排除子图，并可以提高效率。

>>> x = Variable(torch.randn(5, 5))
>>> y = Variable(torch.randn(5, 5))
>>> z = Variable(torch.randn(5, 5), requires_grad=True)
>>> a = x + y
>>> a.requires_grad
False
>>> b = a + z
>>> b.requires_grad
True

model = torchvision.models.resnet18(pretrained=True)
for param in model.parameters():
    param.requires_grad = False
# Replace the last fully-connected layer
# Parameters of newly constructed modules have requires_grad=True by default
model.fc = nn.Linear(512, 100)

# Optimize only the classifier
optimizer = optim.SGD(model.fc.parameters(), lr=1e-2, momentum=0.9)

>>> regular_input = Variable(torch.randn(5, 5))
>>> volatile_input = Variable(torch.randn(5, 5), volatile=True)
>>> model = torchvision.models.resnet18(pretrained=True)
>>> model(regular_input).requires_grad
True
>>> model(volatile_input).requi