torch.load中map_location和model.to的关系

最新推荐文章于 2023-11-04 08:00:00 发布

思念殇千寻

最新推荐文章于 2023-11-04 08:00:00 发布

阅读量481

点赞数 1

文章标签：深度学习 python pytorch 人工智能机器学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_43590796/article/details/126842924

版权

　　参考资料：

　　https://discuss.pytorch.org/t/is-map-location-in-torch-load-and-model-load-state-dict-independent-from-device-in-to/99983

　　我的问题和参考资料中的一样，在使用torch.load的时候有一个map_location参数，此时可以将checkpoint等加载到对应的device上。但是，如果接下来初始化一个model，并且使用model.load_state_dict的话，后续打印model的device仍然是在cpu上。这意味着我们仍然需要再接上一个model.to(XX)。

　　参考资料中一个大佬给出了解释：

The map_location changes the device of the Tensors in the state dict that is returned.
But when you load_state_dict(), then these values are loaded (and only values) into the model. But that does not change the model’s device! you will need to move the model itself with .to() if you want to have it on a different device.

　　翻译过来就是使用torch.load加载的都是tensor，比如：这些tensor如果之前是在device("cuda:1")上被保存的，那么load的时候pytorch仍然会试图将tensor加载到第一个显卡上。但是，load_state_dict不管你的state_dict放在哪里，它只load值，所以如果你的model在cpu上初始化了，它只是从GPU卡上把tensor(state_dict)的值copy到cpu的model上，model所处的device还是cpu。

　　综上所述，只有to指令才能让model换device！

思念殇千寻

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
torch.load中map_location和model.to的关系

　　参考资料：　　https://discuss.pytorch.org/t/is-map-location-in-torch-load-and-model-load-state-dict-independent-from-device-in-to/99983　　我的问题和参考资料中的一样，在使用torch.load的时候有一个map_location参数，此时可以将checkpoint等加...
复制链接

扫一扫

思念殇千寻 CSDN认证博客专家 CSDN认证企业博客

码龄6年

259: 原创

5万+: 周排名

1万+: 总排名

68万+: 访问

: 等级

4241: 积分

141: 粉丝

547: 获赞

151: 评论

1212: 收藏

私信

关注

热门文章

最新评论

CLIP损失函数的理解
Allen_Smath: 请问，作者实现的这个较为复杂的损失函数是不是相比于交叉熵损失来说较为复杂？还有就是，不是很清楚是如何减轻了多个cation对应同一个image的情况带来的问题
CLIP损失函数的理解
Allen_Smath: 有一个方面不是很理解，就是这篇Tutorial作者提到的两个cation对应同一张图片的问题。假设文本a1和a1同时对应图像A，那么做矩阵乘法时，A的embedding和a1、a2的embedding相乘结果都差不多，然后最后只能取更大的那个，请问这个怎么能做到拉小a1和a2最后生成embedding之间的距离啊
CUDA_LAUNCH_BLOCKING=1的作用
咸菜萝卜头: [code=python] RuntimeError: CUDA error: misaligned address CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. [/code] 你好，能帮我看看以上的报错吗
RuntimeError: CUDA error: device-side assert triggered的解决
zgm3345639917: 我用Cityscapes默认的19类生成数据集，在Deeplabv3+上进行训练，真实标签里像素值从road为0开始，到18结束，其他不想训练的区域与类别像素值均为255，即标签里白色部分，加上背景一共是20类，num_classes设置为20，损失函数为CEloss，把ignore_index设置成255，训练就报cuda错误，改成ignore_index=num_classes就可以进行训练，但是预测结果中road被当成背景了，实际那些255的区域并没有区分成背景，能帮我分析下原因吗，cuda错误与你这一样
由于失败的登录次数过多或重复关机，此登录选项已被禁用。请使用其他登录选项，或者保持设备开机至少2小时，然后重试。...
圆梦434: 谢谢，装驱动重启太多次正发愁呢

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。