高斯混合模型GMM理论和Python实现

最新推荐文章于 2024-08-10 08:28:05 发布

肥宅_Sean

最新推荐文章于 2024-08-10 08:28:05 发布

阅读量5.1k

点赞数 1

分类专栏： Python 机器学习+深度学习+强化学习算法机器学习+深度学习+强化学习文章标签： GMM 聚类 Python 机器学习

本文链接：https://blog.csdn.net/a19990412/article/details/89041610

版权

Python 同时被 3 个专栏收录

282 篇文章 32 订阅

订阅专栏

算法

74 篇文章 1 订阅

订阅专栏

机器学习+深度学习+强化学习

48 篇文章 12 订阅

订阅专栏

https://github.com/Sean16SYSU/MachineLearningImplement

简述

高斯混合模型，就是说用多个高斯函数去描述不同的元素分布。
通过EM方法来迭代生成不同的高斯模型的各个参数。

具体的EM算法的理论网上很多，但推荐各位先看完这个算法思路之后，再去看理论推导就更加好了。

更新方法

$\mu_i^{'} = \frac{\sum_{j=1}^m{\eta_{ji} * x_j}}{\sum_{j=1}^m{\eta_{ji}}}$
$\Sigma_i^{'} = \frac{\sum_{j=1}^m{\eta_{ji} * (x_j - \mu_i^{'}) * (x_j - \mu_i^{'})^T}}{\sum_{j=1}^m{\eta_{ji}}}$
$\alpha_i^{'} = \frac{\sum_{j=1}^m{\eta_{ji}}}{m}$

m是点数量
$\eta_{ij}=P(z_j=i|x_j)$ 这个概率就是在第 $i$ 个高斯模型下样本 $x_j$ 的概率

通过这样不断地迭代，最后，用这个 $\eta_{ij}$ 来计算最后的聚类结果
即，对于每一个样本，属于概率最高的高斯分布的所对应的高斯分布。

Python实现

导入数据

from sklearn import datasets
import numpy as np 
import matplotlib.pyplot as plt

iris = datasets.load_iris()

算法实现

from scipy import stats


def GMMs(X, k=3, steps=10):
    
    def p(x, mu, sigma):
        n = len(x)
        div = (2 * np.pi) ** (n / 2) * (abs(np.linalg.det(sigma)) ** 0.5)
        expOn = -0.5 * ( np.dot( (x - mu).T,  np.dot(np.linalg.inv(sigma), (x - mu)) ) )      
        return np.exp(expOn) / div
        
    def init(X):
        _, n = X.shape
        return np.random.rand(k, n), 2 * np.random.rand(k, n, n) + 1, np.random.rand(k)
    
    # k个Gausssian distribution
    mus, sigmas, alphas = init(X)
    # EM algorithm
    # E-step
    mat = np.zeros((len(X), k))
    for times in range(steps):
        for j, x in enumerate(X):
            temp, tempP = 0, 0
            for i in range(k):
                tempP = p(x, mus[i], sigmas[i])
                temp += tempP
                mat[j][i] = alphas[i] * tempP
            mat[j] /= temp
        
        for i in range(k):
            # updata mus
            mus[i] = np.dot(mat[:, i].T, X) / sum(mat[:, i])
            
            # update sigmas
            temp = np.zeros(sigmas[0].shape)
            for j in range(len(X)):
                data = (X[j] - mus[i]).reshape(4, 1)
                temp += mat[j][i] * np.dot(data, data.T)
            temp /= sum(mat[:, i])
            sigmas[i] = temp
            alphas[i] = sum(mat[:, i]) / len(X)
    # clustering
    Ans = np.zeros(len(X))
    for j, x in enumerate(X):
        temp, tempP = 0, 0
        for i in range(k):
            tempP = p(x, mus[i], sigmas[i])
            temp += tempP
            mat[j][i] = alphas[i] * tempP
        mat[j] /= temp
        Ans[j] = np.argmax(mat[j])
    return Ans

test_y = GMMs(iris.data, steps=20)

画图

from sklearn.decomposition import PCA

X_reduced = PCA(n_components=2).fit_transform(iris.data)
plt.scatter(X_reduced[:, 0], X_reduced[:, 1], c=test_y, cmap=plt.cm.Set1)

在这里插入图片描述

评估

def evaluate(y, t):
    a, b, c, d = [0 for i in range(4)]
    for i in range(len(y)):
        for j in range(i+1, len(y)):
            if y[i] == y[j] and t[i] == t[j]:
                a += 1
            elif y[i] == y[j] and t[i] != t[j]:
                b += 1
            elif y[i] != y[j] and t[i] == t[j]:
                c += 1
            elif y[i] != y[j] and t[i] != t[j]:
                d += 1
    return a, b, c, d

def external_index(a, b, c, d, m):
    JC = a / (a + b + c)
    FMI = np.sqrt(a**2 / ((a + b) * (a + c)))
    RI = 2 * ( a + d ) / ( m * (m + 1) )
    return JC, FMI, RI

def evaluate_it(y, t):
    a, b, c, d = evaluate(y, t)
    return external_index(a, b, c, d, len(y))