Data Mining and Machine Learning笔记 1

B栋3食堂

已于 2022-03-03 02:59:39 修改

阅读量477

点赞数

分类专栏：数学文章标签：数据挖掘机器学习人工智能

于 2022-02-25 18:33:03 首次发布

本文链接：https://blog.csdn.net/MINGRAN_JIA/article/details/122464746

版权

数学专栏收录该内容

8 篇文章 0 订阅

订阅专栏

Content

1 PCA

Machine Learning 分类
在这里插入图片描述

1 PCA

Consider linear combinations:
$Y_1=a_{11}X_1+a_{12}X_2+...+a_{1p}X_p=\mathrm{a_1}^\mathsf{T}X\\Y_2=a_{21}X_1+a_{22}X_2+...+a_{2p}X_p=\mathrm{a_2}^\mathsf{T}X\\.\\.\\.\\Y_3=a_{p1}X_1+a_{p2}X_2+...+a_{pp}X_p=\mathrm{a_p}^\mathsf{T}X$
PCA

The linear combinations $Y_1,Y_2,...,Y_p$ are principal components
$\mathrm{a_j}$ is the eigenvector of the $j^{th}$ principal components
$a_{j1},...,a_{jp}$ are the loadings of the $j_{th}$ principal component. Loadings make up the principal component loading vector: $a_j=(a_{j1},...,a_{jp})^\mathsf{T}$
socre
$y_{ij}=a_{j1}x_{i1}+a_{j2}x_{i2}+...+a_{jp}x_{ip}$
calculate the observation in new coordinate system of principal components.

Properties

$Y_1,Y_2,...,Y_p$ are pairwise-uncorrelated, $Var(Y)=diag(\lambda_1,...,\lambda_p)=\mathbb{\Lambda}$ ， where $\lambda_j$ is the $j^{th}$ eigenvalue of $\Sigma$ , $Var(Y_1)=a^\mathsf{T}\Sigma a$
The total variance is preserved under the principal component transformation, $\Sigma^p_{j=1}Var(Y_j)=\Sigma^{p}_{j=1}Var(X_j)$
The first $k$ principal components account for the proportion $\frac{\Sigma^k_{j=1}\lambda_j}{\Sigma^k_{j=1}\lambda_p}$ of the total variance