pytorch：with torch.no_grad:

最新推荐文章于 2024-05-01 17:21:17 发布

缦旋律

最新推荐文章于 2024-05-01 17:21:17 发布

阅读量785

点赞数

分类专栏： Python零碎知识点

小陈一行一行地敲出来的啦~

本文链接：https://blog.csdn.net/weixin_41391619/article/details/109350463

版权

Python零碎知识点专栏收录该内容

11 篇文章 0 订阅

订阅专栏

觉得这位博主解释得不错：
link

但是目前存在的问题是：

x = torch.randn(n,d_in)
y = torch.randn(n,d_out)

w1 = torch.randn(d_in,H,requires_grad=True)
w2 = torch.randn(H,d_out,requires_grad=True)

lr = 1e-6

epoch = 500

for it in range(2):
    #forward pass
    y_pred = x.mm(w1).clamp(min=0).mm(w2) #output shape : n*d_out
    
    # compute loss
    loss = (y_pred-y).pow(2).sum()
#     print(it,loss)
    
    # backward pass
        #1.compute grad
    loss.backward()
    print('这是第{}次迭代，loss为{}，w1.requires_grad:{},w2.requires_grad{}'.format(it+1,loss.item(),w1.requires_grad,w2.requires_grad))
    
        #2.upgrade weights of w1 and w2
    with torch.no_grad():
        w1 -= lr*w1.grad
        w2 = w2-lr*w2.grad
        print('w1.requires_grad:',w1.requires_grad)
        print('w2.requires_grad:',w2.requires_grad)

在这里插入图片描述
可以看到，当自减时，即用‘-=’时，w1经过更新，仍然有grad；但是如果是用常规等式减，w2经过更新，就不再有grad。
所以：
question1：为什么有上述区别？
question2：明明都已经with torch.no_grad了，为什么w1还是有grad？

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

缦旋律

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
pytorch：with torch.no_grad:

觉得这位博主解释得不错：link但是目前存在的问题是：x = torch.randn(n,d_in)y = torch.randn(n,d_out)w1 = torch.randn(d_in,H,requires_grad=True)w2 = torch.randn(H,d_out,requires_grad=True)lr = 1e-6epoch = 500for it in range(2): #forward pass y_pred = x.mm(w1).cl
复制链接

扫一扫