EM算法公式推导一

EM算法公式推导两种方式
1 引入因变量Z l o g ( p ( x ∣ θ ) ) = l o g ( ∫ p ( z , x ∣ θ ) d z ) ( 1 ) log(p(x|\theta))=log(\int p(z,x|\theta) dz)\quad(1) log(p(xθ))=log(p(z,xθ)dz)(1)

2 根据贝叶斯公式
l o g ( p ( x ) ) = l o g ( p ( x , z ) ) l o g ( q ( z ) ) − l o g ( p ( z ∣ x ) ) l o g ( q ( z ) ) log(p(x))=\frac{log(p(x,z))}{log(q(z))}-\frac{log(p(z|x))}{log(q(z))} log(p(x))=log(q(z))log(p(x,z))log(q(z))log(p(zx))

方法1 引入因变量Z

l o g ( p ( x ∣ θ ) ) = l o g ( ∫ p ( z , x ∣ θ ) d z ) ( 1 ) log(p(x|\theta))=log(\int p(z,x|\theta) dz)\quad(1) log(p(xθ))=log(p(z,xθ)dz)(1)
l o g ( p ( x ) ) = l o g ( ∫ q ( z ) p ( z , x ∣ θ ) q ( z ) ) ( 2 ) log(p(x))=log(\int q(z)\frac{p(z,x|\theta)}{q(z)}) \quad(2) log(p(x))=log(q(z)q(z)p(z,xθ))(2)
由于log函数是凸函数
l o g ( p ( x ) ) > = ∫ q ( z ) l o g ( p ( z , x ∣ θ ) q ( z ) ) ( 3 ) log(p(x))>=\int q(z)log(\frac{p(z,x|\theta)}{q(z)})\quad(3) log(p(x))>=q(z)log(q(z)p(z,xθ))(3)
取等号时, p ( z , x ∣ θ ) q ( z ) = c ( 4 ) \frac{p(z,x|\theta)}{q(z)}=c \quad (4) q(z)p(z,xθ)=c(4)是常数
p ( z , x ∣ θ ) = c ∗ q ( z ) ( 5 ) p(z,x|\theta)=c*q(z)\quad(5) p(z,xθ)=cq(z)(5)
两边同时对z积分
∫ z p ( z , x ∣ θ ) d z = ∫ z c ∗ q ( z ) d z ( 6 ) \int _z p(z,x|\theta) dz=\int _z c* q(z) dz\quad(6) zp(z,xθ)dz=zcq(z)dz(6)
可得 p ( x ∣ θ ) = c p(x|\theta)=c p(xθ)=c
根据公式4
p ( z , x ∣ θ ) q ( z ) = p ( x ∣ θ ) q ( z ) = p ( z ∣ x , θ ) \frac{p(z,x|\theta)}{q(z)}=p(x|\theta)\\ q(z)=p(z|x,\theta) q(z)p(z,xθ)=p(xθ)q(z)=p(zx,θ)
q(z)=p(z|x,theta)带入公式3
l o g ( p ( x ) ) > = ∫ q ( z ) l o g ( p ( z , x ∣ θ ) q ( z ) ) l o g ( p ( x ) ) = ∫ p ( z ∣ x , θ t ) l o g ( p ( z , x ∣ θ t + 1 ) p ( z ∣ x , θ t ) ) log(p(x))>=\int q(z)log(\frac{p(z,x|\theta)}{q(z)})\quad \\ log(p(x))=\int p(z|x,\theta^t)log(\frac{p(z,x|\theta^{t+1})}{p(z|x,\theta^t)}) log(p(x))>=q(z)log(q(z)p(z,xθ))log(p(x))=p(zx,θt)log(p(zx,θt)p(z,xθt+1))

方法2 根据公式log(p(x))=log(p(x,z))-log(p(z|x))

由于 p ( x ) = p ( x , z ) p ( z ∣ x ) ( 1 ) 由于 p(x)=\frac{p(x,z)}{p(z|x )} \quad(1) 由于p(x)=p(zx)p(x,z)(1)
l o g ( p ( x ) ) = l o g ( p ( x , z ) ) − l o g ( p ( z ∣ x ) ) ( 2 ) log(p(x))=log(p(x,z))-log(p(z|x)) \quad(2) log(p(x))=log(p(x,z))log(p(zx))(2)
l o g ( p ( x ) ) = l o g ( p ( x , z ) ) l o g ( q ( z ) ) − l o g ( p ( z ∣ x ) ) l o g ( q ( z ) ) ( 3 ) log(p(x))=\frac{log(p(x,z))}{log(q(z))}-\frac{log(p(z|x))}{log(q(z))} \quad(3) log(p(x))=log(q(z))log(p(x,z))log(q(z))log(p(zx))(3)
两边同时对q(z)积分
左边 = ∫ q ( z ) l o g ( p ( x ) )   d z ( 4 ) 左边=\int {q(z)log(p(x)) }\,{\rm d}z\quad(4) 左边=q(z)log(p(x))dz(4)
由于积分与x无关
左边 = l o g ( p ( x ) ) ( 5 ) 左边= log(p(x)) \quad(5) 左边=log(p(x))(5)
右边 = ∫ q ( z ) ( l o g ( p ( x , z ) ) l o g ( q ( z ) ) − l o g ( p ( z ∣ x ) ) l o g ( q ( z ) ) )   d z ( 6 ) 右边=\int {q(z)(\frac{log(p(x,z))}{log(q(z))}-\frac{log(p(z|x))}{log(q(z))} )}\,{\rm d}z \quad(6) 右边=q(z)(log(q(z))log(p(x,z))log(q(z))log(p(zx)))dz(6)
右边可以变成两项
∫ q ( z ) ( l o g ( p ( x , z ) ) l o g ( q ( z ) )   d z − ∫ q ( z ) l o g ( p ( z ∣ x ) ) l o g ( q ( z ) )   d z ( 7 ) \int {q(z)(\frac{log(p(x,z))}{log(q(z))}\,{\rm d}z -\int q(z)\frac{log(p(z|x))}{log(q(z))} }\,{\rm d}z \quad(7) q(z)(log(q(z))log(p(x,z))dzq(z)log(q(z))log(p(zx))dz(7)
第二项就是 K L 散度 ∫ q ( z ) l o g ( p ( z ∣ x ) ) l o g ( q ( z ) )   d z ( 8 ) 第二项 就是KL散度 \int {q(z)\frac{log(p(z|x))}{log(q(z))} }\,{\rm d}z \quad(8) 第二项就是KL散度q(z)log(q(z))log(p(zx))dz(8)
第一项 ∫ q ( z ) ( l o g ( p ( x , z ) ) l o g ( q ( z ) )   d z 为 E L B O ( 9 ) 第一项 \int q(z)(\frac{log(p(x,z))}{log(q(z))}\,{\rm d}z 为ELBO\quad(9) 第一项q(z)(log(q(z))log(p(x,z))dzELBO(9)
当 k l 距离为 0 时, p ( x ) 概率最大 , 即 l o g ( p ( z ∣ x ) ) l o g ( q ( z ) ) 为常数 当kl距离为0时,p(x)概率最大,即\frac{log(p(z|x))}{log(q(z))} 为常数 kl距离为0时,p(x)概率最大,log(q(z))log(p(zx))为常数
此时 q ( z ) = p ( z ∣ x ) 此时 q(z)=p(z|x) 此时q(z)=p(zx)
将 q ( z ) = p ( z ∣ x ) 带入第一项 ∫ p ( z ∣ x ) ( l o g ( p ( x , z ) ) l o g ( p ( z ∣ x ) )   d z 将q(z)=p(z|x)带入第一项 \int p(z|x)(\frac{log(p(x,z))}{log(p(z|x))}\,{\rm d}z q(z)=p(zx)带入第一项p(zx)(log(p(zx))log(p(x,z))dz
EM 中都有\theta作为条件,在求最大化\theta时,log(p(z|x)与\theta无关可以省略
a r g m a x θ ∫ p ( z ∣ x ) ( l o g ( p ( x , z ) )   d z ( 10 ) argmax_{\theta} \int p(z|x)(log(p(x,z))\,{\rm d}z\quad(10) argmaxθp(zx)(log(p(x,z))dz(10)

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值