Variational Inference Review

变分推断是一种通过找到密度族中最接近目标密度的分布来近似复杂密度的方法。相对于MCMC,它更易于计算且适合大规模数据。本文主要介绍了变分推断的问题、变分目标函数、平均场变分家族以及坐标上升变分推断(CAVI),并通过贝叶斯高斯混合模型举例说明。
摘要由CSDN通过智能技术生成

Variational Inference Review

Idea: posit a family of density and find one from the family that is the closest (in K-L divergence) to the target density.
For statisticians: VI provides a method to approximate complicate densities. Compared with MCMC, it’s easier to compute and scale to big data.

Problem of Approximate Inference

x x x-observations
z z z-latent variables
want to estimate
p ( z ∣ x ) = p ( z , x ) p ( x ) p(z|x)=\frac{p(z,x)}{p(x)} p(zx)=p(x)p(z,x)

pick a family of distribution Q \mathcal{Q} Q, and approximate p ( z ∣ x ) p(z|x) p(zx) by
q ∗ ( z ) = arg min ⁡ q ( z ) ∈ Q   K L ( q ( z ) ∣ ∣ p ( z ∣ x ) ) q^*(z)=\argmin_{q(z) \in \mathcal{Q}}\ KL(q(z)||p(z|x)) q(z)=q(z)Qargmin KL(q(z)p(zx))

Variational Object Function

K L ( q ( z ) ∣ ∣ p ( z ∣ x ) ) = E q [ log ⁡ q ( z ) ] − E q [ log ⁡ p ( z ∣ x ) ] = E q [ log ⁡ q ( z ) ] − E q [ log ⁡ p ( z , x ) ] + log ⁡ p ( x ) = − E L B O ( q ) + c o n s t . KL(q(z)||p(z|x))=E_q[\log q(z)]-E_q[\log p(z|x)] \\ = E_q[\log q(z)]-E_q[\log p(z,x)]+\log p(x) \\ = -ELBO(q)+const. KL(q(z)p(zx))=Eq[logq(z)]Eq[logp(zx)]=Eq[logq(z)]Eq[logp(z,x)]+logp(x)=ELBO(q)+const.

ELBO = evidence lower bound, and minimizing K-L is equivalent to maximizing ELBO:
E L B O ( q ) = E q [ log ⁡ p ( z , x ) ] − E q [ log ⁡ q ( z ) ] = E q [ log ⁡ p ( z ) ] + E q [ log ⁡ p ( x ∣ z ) ] − E q [ log ⁡ q ( z ) ] = E [ log ⁡ p ( x ∣ z ) ] ⏟ e x p e c t e d   l i k e h o o d − K L ( q ( z ) ∣ ∣ p ( z ) ) ⏟ p e n a l i z e   d e v i a t i o n   f r o m   p r i o r ELBO(q)=E_q[\log p(z,x)]-E_q[\log q(z)] \\ = E_q[\log p(z)]+E_q[\log p(x|z)]-E_q[\log q(z)] \\ = \underbrace{E[\log p(x|z)]}_{expected\ likehood}-\underbrace{KL(q(z)||p(z))}_{penalize\ deviation\ from\ prior} ELBO(q)=Eq[logp(z,x)]Eq[logq(z)]=Eq[logp(z)]+Eq[logp(xz)]Eq[logq(z)]=expected likehood E[logp(xz)]

评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值