GMM & K-means 高斯混合模型和K-means聚类详解

最新推荐文章于 2024-01-10 19:22:32 发布

Jay_Tang

最新推荐文章于 2024-01-10 19:22:32 发布

阅读量1.2k

点赞数 3

分类专栏：机器学习核心推导文章标签：机器学习

本文链接：https://blog.csdn.net/Jay_Tang/article/details/106153760

版权

本文详细介绍了高斯混合模型（GMM）和K-means聚类算法。GMM从几何和混合模型两个角度进行解释，通过最大似然估计（MLE）和EM算法求解。K-means是一种常见的聚类算法，通过迭代更新质心来将数据点分配到最近的簇中。GMM作为更通用的模型，包含了K-means并允许概率分配。

摘要由CSDN通过智能技术生成

往期文章链接目录

文章目录

Gaussian mixture model (GMM)

A Gaussian mixture model is a probabilistic model that assumes all the data points are generated from a mixture of a finite number of Gaussian distributions with unknown parameters.

Interpretation from geometry

$p (x)$ is a weighted sum of multiple Gaussian distribution.

$p(x)=\sum_{k=1}^{K} \alpha_{k} \cdot \mathcal{N}\left(x | \mu_{k}, \Sigma_{k}\right)$

Interpretation from mixture model

setup:

The total number of Gaussian distribution $K$ .
$x$ , a sample (observed variable).
$z$ , the distribution of the sample $x$ (a latent variable), where
- $\in \{c_1, c_2, ..., c_K\}$ .
- $\sum_{k=1}^K p(z=c_k)= 1$ . We denote $p(z=c_k)$ by $p_k$ .

Mixture models are usually generative models, which means new data can be drawn from the distribution of models. Specifically, in the Gaussian Mixture Model (GMM), a new data is generated by first select a class $c_k$ based on the probability distribution of all classes $c$ , and then draw a value from the Gaussian distribution of that class. Therefore, we could write $p (x)$ as the following

$\begin{aligned} p(x) &= \sum_z p(x,z) \\ &= \sum_{k=1}^{K} p(x, z=c_k) \\ &= \sum_{k=1}^{K} p(z=c_k) \cdot p(x|z=c_k) \\ &= \sum_{k=1}^{K} p_k \cdot \mathcal{N}(x | \mu_{k}, \Sigma_{k}) \end{aligned}$

We see that two ways of interpretation reach to the same result.

GMM Derivation

set up

X: observed data, where $X = (x_1, x_2, ..., x_N)$
$\theta$ : parameter of the model, where $\theta=\left\{p_{1}, p_{2}, \cdots, p_{K}, \mu_{1}, \mu_{2}, \cdots, \mu_{K}, \Sigma_{1}, \Sigma_{2}, \cdots, \Sigma_{K}\right\}$
$\sum_{k=1}^{K} p_k \cdot \mathcal{N}(x | \mu_{k}, \Sigma_{k})$ .
$\cdot p(x|z) = p_z \cdot \mathcal{N}(x | \mu_{z}, \Sigma_{z})$
$\frac{p(x,z)}{p(x)} = \frac{p_z \cdot \mathcal{N}(x | \mu_{z}, \Sigma_{z})}{\sum_{k=1}^K p_z \cdot \mathcal{N}(x | \mu_{z}, \Sigma_{z})}$

Solve by MLE

$\begin{aligned} \hat{\theta}_{MLE} &= \underset{\theta}{\operatorname{argmax}} p(X) \\ &=\underset{\theta}{\operatorname{argmax}} \log p(X) \\ &=\underset{\theta}{\operatorname{argmax}} \sum_{i=1}^{N} \log p\left(x_{i}\right) \\ &=\underset{\theta}{\operatorname{argmax}} \sum_{i=1}^{N} \log \, [\sum_{i=1}^{K} p_{k} \cdot \mathcal{N}\left(x_{i} | \mu_{k}, \Sigma_{k}\right)] \end{aligned}$

最低0.47元/天解锁文章

Jay_Tang

关注

3
点赞
踩
7

收藏

觉得还不错? 一键收藏
0
评论
GMM & K-means 高斯混合模型和K-means聚类详解

往期文章链接目录文章目录往期文章链接目录Gaussian mixture model (GMM)Interpretation from geometryInterpretation from mixture modelGMM Derivationset upSolve by MLESolve by EM AlgorithmK-means往期文章链接目录Gaussian mixture model (GMM)A Gaussian mixture model is a probabilistic mode
复制链接

扫一扫

专栏目录