在 eval 的时候运行一遍网络，发现显存增加特别快

最新推荐文章于 2024-08-10 23:14:03 发布

NeRF_er

最新推荐文章于 2024-08-10 23:14:03 发布

阅读量160

点赞数

文章标签： python 人工智能计算机视觉

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/qq_41623632/article/details/132717635

版权

应该是变量的梯度没有关闭，导致在运行网络的时候，梯度也被储存在了 CUDA 里面。

解决方法是：在被调用的网络函数前面加一个装饰器，（这个函数可以是专门用来eval 的时候用的），

@torch.no_grad()  ## 在运行这个函数的时候，不会计算梯度
def get_pos_density(self, positions):
      """Computes and returns the densities."""
      assert self.spatial_distortion is not None
      positions = self.spatial_distortion(positions)
      positions = (positions + 2.0) / 4.0
      self._sample_locations = positions
      if not self._sample_locations.requires_grad:
          self._sample_locations.requires_grad = True
      positions_flat = positions.view(-1, 3)
      h = self.mlp_base(positions_flat)
      density_before_activation, base_mlp_out = torch.split(h, [1, self.geo_feat_dim], dim=-1)
      self._density_before_activation = density_before_activation

      # Rectifying the density with an exponential is much more stable than a ReLU or
      # softplus, because it enables high post-activation (float32) density outputs
      # from smaller internal (float16) parameters.
      density = trunc_exp(density_before_activation.to(positions))[:, 0]
      # print(base_mlp_out)
      return density, base_mlp_out

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
在 eval 的时候运行一遍网络，发现显存增加特别快

解决方法是：在被调用的网络函数前面加一个装饰器，（这个函数可以是专门用来eval 的时候用的），
复制链接

扫一扫

NeRF_er CSDN认证博客专家 CSDN认证企业博客

码龄7年

58: 原创

23万+: 周排名

3万+: 总排名

9万+: 访问

: 等级

1026: 积分

304: 粉丝

277: 获赞

64: 评论

624: 收藏

私信

关注

热门文章

分类专栏

python 9篇
机器人坐标系 1篇
Endnote
slam 7篇
cmake 1篇
ubuntu 18.04 3篇
opencv 5篇
c++ 14篇
ros 2篇
IEEE论文校验
pyhton 2篇
pwcnet 1篇
optical flow 1篇
ssr 1篇
nvidia驱动 1篇

最新评论

将针孔模型相机应用到3DGS
qq_40200989: 您好，我有一个问题，3DGS在计算2D covariance矩阵的时候使用的J矩阵不应该是projection matrix的近似吗，我看他在cuda代码中的形式是glm::mat3 J = glm::mat3( focal_x / t.z, 0.0f, -(focal_x * t.x) / (t.z * t.z), 0.0f, focal_y / t.z, -(focal_y * t.y) / (t.z * t.z), 0, 0, 0);请问是如何推导出来的呢因为就算用P[0, 0] = 2 * fx / W P[1, 1] = 2 * fy / H P[0, 2] = 2 * (cx / W) - 1.0 P[1, 2] = 2 * (cy / H) - 1.0 P[2, 2] = -(zfar + znear) / (zfar - znear) P[3, 2] = 1.0 P[2, 3] = -(2 * zfar * znear) / (zfar - znear) 那推导出来应该是glm::mat3 J = glm::mat3( focal_x /W* t.z, 0.0f, -(focal_x * t.x) / (H*t.z * t.z), 0.0f, focal_y / H*t.z, -(focal_y * t.y) / (H*t.z * t.z), 0, 0, 0)啊
Scaffold-GS 代码阅读笔记
好脾气先生: 我观察到在代码中模型的scene.gaussians.train()以及gaussians.optimizer.step()都是在with torch.no_grad():的状态下进行的，这样不会导致参数无法更新吗？只依靠loss.backward()是否可以更新参数呢？
Pytorch Lighting & Hydra库的学习
ha_lydms: 非常不错的技术领域文章分享，解决了我在实践中的大问题！博主很有耐心，更有对知识的热忱和热爱，写了这么实用有效的分享，值得收藏点赞。
Scaffold-GS 代码阅读笔记
胖路: # enable logging model_path = args.model_path os.makedirs(model_path, exist_ok=True) logger = get_logger(model_path) logger.info(f'args: {args}') 请问在复现代码时，上面这段代码中运行后出现系统找不到指定位置，并且代码中model_path没有对这个参数设定，应该怎么解决嘞？
VScode 里面使用 python 去直接调用 CUDA
ha_lydms: 这篇博客的内容总是能够触动我的心灵，让我对于人生有了更深的体会。

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。