本文将主成分分析(Principal Component Analysis)应用于Iris 数据集。
有关此数据集的详细信息,请参见 此处 。![59aed0bd3586c1beb9a3feb6b5b67a69.png](https://i-blog.csdnimg.cn/blog_migrate/5a48d6a75b9310ca034b5af7bb959245.png)
print(__doc__)# 代码来源:Gaël Varoquaux# 许可证:BSD 3 clauseimport numpy as npimport matplotlib.pyplot as pltfrom mpl_toolkits.mplot3d import Axes3Dfrom sklearn import decompositionfrom sklearn import datasets
np.random.seed(5)
centers = [[1, 1], [-1, -1], [1, -1]]
iris = datasets.load_iris()
X = iris.data
y = iris.target
fig = plt.figure(1, figsize=(4, 3))
plt.clf()
ax = Axes3D(fig, rect=[0, 0, .95, 1], elev=48, azim=134)
plt.cla()
pca = decomposition.PCA(n_components=3)
pca.fit(X)
X = pca.transform(X)for name, label in [('Setosa', 0), ('Versicolour', 1), ('Virginica', 2)]:
ax.text3D(X[y == label, 0].mean(),
X[y == label, 1].mean() + 1.5,
X[y == label, 2].mean(), name,
horizontalalignment='center',
bbox=dict(alpha=.5, edgecolor='w', facecolor='w'))# 重新排序标签,使得颜色与聚类结果相匹配
y = np.choose(y, [1, 2, 0]).astype(np.float)
ax.scatter(X[:, 0], X[:, 1], X[:, 2], c=y, cmap=plt.cm.nipy_spectral,
edgecolor='k')
ax.w_xaxis.set_ticklabels([])
ax.w_yaxis.set_ticklabels([])
ax.w_zaxis.set_ticklabels([])
plt.show()
脚本的总运行时间:( 0 分 0.338 秒)
估计的内存使用量: 8 MB
![b2e63a8fd5f462d046387cd7e5faac1d.png](https://i-blog.csdnimg.cn/blog_migrate/7e0b450f6f3b3bc6bde81e92cac6c719.png)
下载python源代码:plot_ica_vs_pca.py
下载Jupyter notebook源代码:plot_ica_vs_pca.ipynb
![ac230102e7367bfea805a581a863f207.png](https://i-blog.csdnimg.cn/blog_migrate/6de90fde801edfe6135df224f8aba38b.png)
![5f56de1d0ca884ae27a36a167a4b2210.png](https://i-blog.csdnimg.cn/blog_migrate/83858781eae4a6bd7ef319d5ca0b3d1e.png)
![fac361425486fd3ab129e7eb59383ef7.png](https://i-blog.csdnimg.cn/blog_migrate/2151c6f9d38e2bc6effc134a0b51d2a6.png)