Pytorch入门

最新推荐文章于 2024-11-09 19:41:24 发布

SeringWu

最新推荐文章于 2024-11-09 19:41:24 发布

阅读量61

点赞数

文章标签： pytorch python 人工智能

本文链接：https://blog.csdn.net/Steve1649/article/details/131309659

版权

在PyTorch中，对requires_grad=True的张量不能进行就地操作或直接转换为numpy。要安全地转换，需先detach并可能clone。文章强调了在RL中切换使用numpy和PyTorch，以及定义网络、计算损失、零化梯度、反向传播和更新权重的训练流程。

摘要由CSDN通过智能技术生成

Pytorch tips:(From Berkeley)

A few things to watch out for variables:

You can’t do any in-place operations on a tensor that has requires_grad=True. (This prevents you from inadvertently mutating it in a way that isn’t tracked for backprop purposes.)
You also can’t convert a tensor with requires_grad=True to numpy (for the same reason as above). Instead, you need to detach it first, e.g. y.detach().numpy().
Even though y.detach() returns a new tensor, that tensor occupies the same memory as y. Unfortunately, PyTorch lets you make changes to y.detach() or y.detach.numpy() which will affect y as well! If you want to safely mutate the detached version, you should use y.detach().clone() instead, which will create a tensor in new memory.

RL Connection: You would want to be doing simulator-related tasks with numpy, convert to torch when doing model-related tasks, and convert back to feed output into simulator.

Tips for define networks and compute gradients

Defined a class for our neural network (subclass of nn.Module)
Specified a loss function (MSE loss) and optimizer (Adam) --> make sure to pass all model parameters (especially for multimodal)
Performed training by doing the following in a loop:
- Make prediction
- Compute loss
- Zero the stored gradients
- Backprop the loss with .backward()
- Update the weights by taking a step of gradient descent