EM算法学习笔记

最新推荐文章于 2022-09-07 16:03:26 发布

GeekStuff

最新推荐文章于 2022-09-07 16:03:26 发布

阅读量1.6k

点赞数 1

分类专栏： Algorithm MLDM 文章标签： EM算法

本文链接：https://blog.csdn.net/aspirinvagrant/article/details/50492986

版权

EM is typically used to compute maximum likelihood estimates given incomplete samples. The EM algorithm estimates the parameters of a model iteratively.

Starting from some initial guess, each iteration consists of
- an E step (Expectation step)
- an M step (Maximization step)

EM Applications

Filling in missing data in samples
Discovering the value of latent variables
Estimating the parameters of HMMs
Estimating parameters of finite mixtures
Unsupervised learning of clusters

Silly Example

Let events be “grades in a class”:

event	likehood
w₁ = Gets an A	P(A) = 1/2
w₂ = Gets a B	P(A) = μ
w₃ = Gets a C	P(A) = 2μ
w₄ = Gets a D	P(A) = 1/2 - 3μ

(Note 0 ≤ µ ≤1/6)
Assume we want to estimate µ from data. In a given class there were
a A’s
b B’s
c C’s
d D’s
What is the maximum likelihood estimate of µ given a,b,c,d ?

Trivial Statistics

$P(a,b,c,d|μ) = (1/2)^a(μ)^b(2μ)^c(1/2-3μ)^d$
$logP(a,b,c,d|μ) = a\log(1/2) + b\log(μ) + c\log(2μ) + d\log(1/2 - 3μ)$
For max like μ, set $\frac {\partial logP} {\partial \mu}$ , gives max like $\mu = \frac {b+c} {6(b+c+d)}$

Same Problem with Hidden Information

Someone tells us that
Number of High grades (A’s + B’s) = h
Number of C’s = c
Number of D’s = d
What is the max. like estimate of µ now?
We can answer this question circularly:
EXPECTATION
If we know the value of µ we could compute the expected value of a and b
$a:b=1/2:\mu$ $a=\frac {1/2} {1/2+\mu} h$ b=