Pytorch构建模型的技巧小结（1）

最新推荐文章于 2024-05-13 21:47:52 发布

田神

最新推荐文章于 2024-05-13 21:47:52 发布

阅读量1.5k

点赞数

分类专栏：机器学习与神经网络文章标签： pytorch

本文链接：https://blog.csdn.net/StreamRock/article/details/81901466

版权

####1、保存模型参数和加载模型参数

（A）保存参数

# 2 ways to save the net
torch.save(net1, 'net.pkl')  # save entire net
torch.save(net1.state_dict(), 'net_params.pkl')   # save only the parameters

（B）加载参数

# copy net1's parameters into net3
net3.load_state_dict(torch.load('net_params.pkl'))
prediction = net3(x)

上面出现的net1和net3都是nn.Module的实例。

####2、模型参数的钳位

# Clip weights of discriminator
for p in discriminator.parameters():
      p.data.clamp_(-opt.clip_value, opt.clip_value)

p是Module（nn.Module）—— discriminator的参数。这段代码是实现WGAN时用到的。钳位不仅可以实现WGAN，而且它可以消除在训练中出现的nan情况，但钳位的大小很关键。

####3、模型的CUDA化
在配有CUDA的训练过程中，模型和数据都需要加载到CUDA中，pytorch的张量有两种类型，以Float为例：用于CPU——torch.FloatTensor、用于CUDA——torch.cuda.FloatTensor，以下是完整列表：
$\begin{array}{c|lc|r} n & \text{CPU} & \text{CUDA} & \text{Desc.}\\ \hline 1 & \text{torch.FloatTensor} & \text{torch.cuda.FloatTensor} & \text{32-bit floating point} \\ 2 & \text{torch.DoubleTensor} & \text{torch.cuda.DoubleTensor} & \text{64-bit floating point} \\ 3 & \text{N/A} & \text{torch.cuda.HalfTensor} & \text{16-bit floating point} \\ 4 & \text{torch.ByteTensor} & \text{torch.cuda.ByteTensor} & \text{8-bit integer (unsigned)} \\ 5 & \text{torch.CharTensor} & \text{torch.cuda.CharTensor} & \text{8-bit integer (s$

最低0.47元/天解锁文章

田神

关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
Pytorch构建模型的技巧小结（1）

1、保存模型参数和加载模型参数（A）保存参数# 2 ways to save the nettorch.save(net1, 'net.pkl') # save entire nettorch.save(net1.state_dict(), 'net_params.pkl') # save only the parameters（B）加载参数# copy net1's p...
复制链接

扫一扫