Probability Density Reweight

Probability Density Reweight

Reweight 是通过将采样样本乘以 reweight 权重,从而将样本从原始密度 P 0 P_0 P0 转移至新密度 P 1 P_1 P1 的方法。
当从原始密度采样样本 x x x 时, x x x 的期望为
E x ∼ P 0 [ x ] = ∫ P 0 ( x ) x d x ≈ 1 N ∑ i x i (1) E_{x \sim P_0}[x] = \int P_0(x)x dx \approx \frac{1}{N} \sum_i x_i \tag{1} ExP0[x]=P0(x)xdxN1ixi(1)

其中, x i x_i xi 为采样点, N N N 为采样个数。
我们的目的是将 x ∼ P 0 x \sim P_0 xP0 转换为 x ∼ P 1 x \sim P_1 xP1,即求得 x x x P 1 P_1 P1 上的期望
E x ∼ P 1 [ x ] ≈ 1 N ∑ i w ( x i ) ⋅ x i w ( x i ) = P 1 ( x i ) / P 0 ( x i ) (2) E_{x \sim P_1}[x] \approx \frac{1}{N} \sum_i w(x_i) · x_i \tag{2} \\ w(x_i) = P_1(x_i) / P_0(x_i) ExP1[x]N1iw(xi)xiw(xi)=P1(xi)/P0(xi)(2)

其中, w ( x i ) w(x_i) w(xi) 表示 reweight 权重,证明如下所示。
采样时将 x i x_i xi 乘以 w ( x i ) w(x_i) w(xi),根据式 1 可得
1 N ∑ i P 1 ( x i ) P 0 ( x i ) x i ≈ ∫ P 0 ( x ) P 1 ( x ) P 0 ( x ) x d x = ∫ P 1 ( x ) x d x = E x ∼ P 1 [ x ] (3) \frac{1}{N}\sum_i \frac{P_1(x_i)}{P_0(x_i)}x_i \approx \int P_0(x) \frac{P_1(x)}{P_0(x)}x dx = \int P_1(x)xdx = E_{x \sim P_1}[x] \tag{3} N1iP0(xi)P1(xi)xiP0(x)P0(x)P1(x)xdx=P1(x)xdx=ExP1[x](3)

要注意,上述所说的概率密度为标准概率密度,即在定义域内积分为 1 。若 P 0 P_0 P0 P 1 P_1 P1 为非标准概率密度,需要
E x ∼ P 0 [ x ] = 1 ∫ P 0 ( x ) d x ∫ P 0 ( x ) x d x ≈ 1 N ∑ i x i (4) E_{x \sim P_0}[x] = \frac{1}{\int P_0(x) dx} \int P_0(x)x dx\approx \frac{1}{N} \sum_i x_i \tag{4} ExP0[x]=P0(x)dx1P0(x)xdxN1ixi(4)

x x x P 1 P_1 P1 上的期望变为
E x ∼ P 1 [ x ] ≈ ∑ i w ( x i ) ⋅ x i ∑ i w ( x i ) (5) E_{x \sim P_1}[x] \approx \frac{\sum_i w(x_i) · x_i}{\sum_i w(x_i)} \tag{5} ExP1[x]iw(xi)iw(xi)xi(5)

证明如下
1 N ∑ i P 1 ( x i ) P 0 ( x i ) x i ≈ 1 ∫ P 0 ( x ) d x ∫ P 1 ( x ) x d x = ∫ P 1 ( x ) d x ∫ P 0 ( x ) d x E x ∼ P 1 [ x ] ≈ ∑ i w ( x i ) E x ∼ P 1 [ x ] \frac{1}{N}\sum_i \frac{P_1(x_i)}{P_0(x_i)}x_i \approx \frac{1}{\int P_0(x) dx} \int P_1(x)xdx = \frac{\int P_1(x) dx}{\int P_0(x) dx} E_{x \sim P_1}[x] \approx {\sum_i w(x_i)}E_{x \sim P_1}[x] N1iP0(xi)P1(xi)xiP0(x)dx1P1(x)xdx=P0(x)dxP1(x)dxExP1[x]iw(xi)ExP1[x]

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 1
    评论
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值