【torch】rsample与sample的区别

最新推荐文章于 2024-09-08 09:17:20 发布

qq_42725437

最新推荐文章于 2024-09-08 09:17:20 发布

阅读量669

点赞数 1

分类专栏： torch 文章标签：深度学习 pytorch 人工智能

本文链接：https://blog.csdn.net/qq_42725437/article/details/134979212

版权

torch 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

sample()：从概率分布中随机采样。所以，我们不能反向传播，因为它是随机的！（计算图被截断）。

请参阅torch.distributions.normal.Normal中示例的源代码：

def sample(self, sample_shape=torch.Size()):
    shape = self._extended_shape(sample_shape)
    with torch.no_grad():
        return torch.normal(self.loc.expand(shape), self.scale.expand(shape))

torch.normal 返回随机数张量。此外，torch.no_grad() 上下文可以防止计算图进一步增长。

你看，我们不能反向传播。 Sample() 返回的张量仅包含一些数字，而不是整个计算图。

那么，rsample() 是什么？

通过使用 rsample，我们可以反向传播，因为它使计算图保持活动状态。

如何？通过将随机性放在单独的参数中。这称为“重新参数化技巧”。

rsample：使用重新参数化技巧进行采样。

源码中有eps：

def rsample(self, sample_shape=torch.Size()):
    shape = self._extended_shape(sample_shape)
    eps = _standard_normal(shape, dtype=self.loc.dtype, device=self.loc.device)
    return self.loc + eps * self.scale