高斯分布可视化

最新推荐文章于 2023-02-01 15:54:02 发布

KPer_Yang

最新推荐文章于 2023-02-01 15:54:02 发布

阅读量525

点赞数 2

分类专栏：数学文章标签：机器学习 python 算法

本文链接：https://blog.csdn.net/KPer_Yang/article/details/126511506

版权

数学专栏收录该内容

13 篇文章 5 订阅

订阅专栏

高斯分布

基本概念

服从一元高斯分布的一组样本 $X=(x_1,x_2,\dots,x_n)$ ，其中每一个样本都是一个随机数值。但是在多元高斯分布中，每一个样本不是一个数值，而是多维的随机向量，每一个样本都是p维：
$\pmb{x}=\begin{bmatrix} x_1 \\ x_2 \\ \vdots \\ x_p \end{bmatrix}$
假设有n个样本，每一个都是p维，那么可以表示成：
$\pmb{X}=\begin{bmatrix} x_{11} & x_{12} & \cdots & x_{1p} \\ x_{21} & x_{22} & \cdots & x_{2p} \\ \vdots & & & \vdots \\ x_{n1} & x_{n2} & \cdots & x_{np} \end{bmatrix}$
如何去表示这样的样本矩阵？使用多维高斯分布！！

均值参数表示成：
$\pmb{\mu}=\begin{bmatrix} \mu_1 \\ \mu_2 \\ \vdots \\ \mu_p \end{bmatrix}$
多维高斯分布的方差参数表示成协方差矩阵（协方差矩阵表示里面不同随机变量的相关性）：
$\pmb{\Sigma}=\begin{bmatrix} \sigma_{11} & \sigma_{12} & \cdots & \sigma_{1p} \\ \sigma_{21} & \sigma_{22} & \cdots & \sigma_{2p} \\ \vdots & & & \vdots \\ \sigma_{n1} & x_{n2} & \cdots & \sigma_{pp} \end{bmatrix}$
概率密度函数：
$p(x|\theta)=\frac{1}{(2\pi)^{\frac{p}{2}}|\Sigma|^{\frac{1}{2}}}exp[-\frac{1}{2}(x-\mu)^T\Sigma^{-1}(x-\mu)]$

二元高斯分布的实例：

可以看到，不同协方差矩阵和不同均值的高斯分布在形状上是不一样的。而且呈椭圆分布，越接近椭圆中心，样本密度越大；反之越稀疏。而且，协方差矩阵的特征值与椭圆的长短轴的长度有关系，而协方差矩阵的特征向量和椭圆的长短轴的方向有关：


# -*- coding: utf-8 -*-
# @Use     : 
# @Time    : 2022/8/24 19:22
# @FileName: Gaussian.py
# @Software: PyCharm

import numpy as np
import matplotlib.pyplot as plt

mean_1 = np.array([0, 0])
conv_1 = np.array([[1, 0],
                  [0, 1]])

mean_2 = np.array([0, -5])
conv_2 = np.array([[4, 0],
                  [0, 0.25]])

mean_3 = np.array([4, 4])
conv_3 = np.array([[4, -3],
                  [-3, 0.25]])

x_1, y_1 = np.random.multivariate_normal(mean=mean_1, cov=conv_1, size=2000).T
x_2, y_2 = np.random.multivariate_normal(mean=mean_2, cov=conv_2, size=2000).T
x_3, y_3 = np.random.multivariate_normal(mean=mean_3, cov=conv_3, size=2000).T

plt.plot(x_1, y_1, 'ro', alpha=0.05)
plt.plot(x_2, y_2, 'bo', alpha=0.05)
plt.plot(x_3, y_3, 'go', alpha=0.05)

plt.gca().axes.set_xlim(-10, 10)
plt.gca().axes.set_ylim(-10, 10)
plt.grid(ls='--')
plt.show()

#求协方差矩阵的特征值和特征向量
evalue. evevtor = linalg.eig(conv1)
evalue. evevtor = linalg.eig(conv2)
evalue. evevtor = linalg.eig(conv3)