1. 导入需要的包:
# encoding: utf-8
from mpl_toolkits.mplot3d import Axes3D
import matplotlib.pyplot as plt
from sklearn import datasets
from sklearn.decomposition import PCA
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
2. 导入iris数据集并查看数据集基本情况:
iris = datasets.load_iris()
iris_X = iris.data
iris_y = iris.target
print iris_X.shape
print iris.feature_names
print iris.target_names
输出结果:
(150, 4)
['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']
['setosa' 'versicolor' 'virginica']
3. 使用PCA将4维数据降至3维,并查看降维后的前5个样本数据:
print iris_X.shape
print iris_X[0:5]
model_pca = PCA(n_components = 4)
X_pca = model_pca.fit_transform(iris_X)
prin