EM算法

最新推荐文章于 2021-01-22 18:10:06 发布

prupcognition

最新推荐文章于 2021-01-22 18:10:06 发布

阅读量116

点赞数

分类专栏：机器学习算法

本文链接：https://blog.csdn.net/m0_37896765/article/details/94558922

版权

机器学习算法专栏收录该内容

17 篇文章 0 订阅

订阅专栏

EM算法是处理有隐变量的情况下使用的方法
函数g(x)的数学期望可以表示为：
在这里插入图片描述
EM算法思想：
1:先给定一个初始的 $\theta$ 求出f(x)函数的期望Ef(x)，
2:然后再求出使得期望最大化的新的 $\theta$ 值

步骤：

建立对数目标似然函数
$L(\theta)=\sum_{i=1}\log p(x|\theta)$
这里 $表示样本x服从概率为p(x)的某种分布，\theta为分布的参数$
当目标函数不能直接求解，依赖于隐含变量时，我们引入隐含变量，目标函数变为：
$L(\theta)=\displaystyle \sum_{i=1} \log \sum_z p(x,z|\theta)$
分子分母同时乘以z的分布 $Q (z)$ ,构造Jensen不等式
$L(\theta)=\displaystyle \sum_{i=1} \log \sum_z Q(z) \frac{p(x,z| \theta)}{Q(z)}$
应用Jensen不等式:因为log函数是凹函数，所以有 $f(Ex)\geq Ef(x)$ ：
$L(\theta)=\displaystyle \sum_{i=1} \log \sum_z Q(z) \frac{p(x,z| \theta)}{Q(z)} \geq \displaystyle \sum_{i=1} \sum_z Q(z)\log \frac{p(x,z| \theta)}{Q(z)}$
也就是说 $E f (x) 是 f (E x) 的下界函数$ ，
如果能够取到两边相等，要求 $\displaystyle \frac{p(x,z| \theta)}{Q(z)}=C$ ,也就是 $Q(z)=\displaystyle\frac{1}{C}p(x,z| \theta)$ ,两边对 $z 求积分有$
$\int_z Q(z)dz=\int_z \displaystyle\frac{1}{C}p(x,z| \theta)dz$ ,
整理有：
$1=\displaystyle\frac{1}{C}\int_z p(x,z| \theta)dz=\displaystyle\frac{1}{C}p(x| \theta)$
$\theta)$
带入原不等式函数有
$\displaystyle \frac{p(x,z| \theta)}{Q(z)}=C$
$\displaystyle \frac{p(x,z| \theta)}{Q(z)}=p(x| \theta)$
$\displaystyle \frac{p(x,z| \theta)}{p(x| \theta)}=Q(z)$
$\displaystyle p(z |x,\theta)=Q(z) ，可以看出Q(z)等于后验$

EM算法步骤：

${ repeat \ until 收敛:\{$
$\{$
$步骤:Q_z =p(z |x,\theta^t)$
$步骤:\theta^{t+1} =argmax\displaystyle \sum_{i=1} \sum_z Q(z)\log \frac{p(x,z| \theta^{t+1})}{Q(z)}$
$\theta^{t+1} =argmax\displaystyle \sum_{i=1} \sum_z Q(z)(\log p(x,z| \theta^{t+1})-\log Q(z))$
因为 $Q(Z)对于求\theta^{t+1}无关，所以原式可以写做：$
$\theta^{t+1} =argmax\displaystyle \sum_{i=1} \sum_z Q(z)\log p(x,z| \theta^{t+1})$
$\theta^{t+1} =argmax\displaystyle \sum_{i=1} \sum_z p(z |x,\theta^t)\log p(x,z| \theta^{t+1})$
$\}$

prupcognition

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
EM算法

EM算法是处理有隐变量的情况下使用的方法步骤：建立对数目标似然函数L(θ)=∑i=1log⁡p(x∣θ)L(\theta)=\sum_{i=1}\log p(x|\theta)L(θ)=∑i=1logp(x∣θ)这里表示样本x服从概率为p(x)的某种分布，θ为分布的参数表示样本x服从概率为p(x)的某种分布，\theta为分布的参数表示样本x服从概率为p(x)的某种分布，θ为分布的参...
复制链接

扫一扫