一、数据获取
代码示例:
import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import fetch_lfw_people
faces = fetch_lfw_people()
faces.data.shape
#运行结果:(13233, 2914)
faces.images.shape
#运行结果:(13233, 62, 47)
ramdom_indexes = np.random.permutation(len(faces.data)) #获得随机排列
X = faces.data[ramdom_indexes]
example_faces = X[:36,:] #获得随机36张脸
#图形可视化
def plot_digits(faces):
fig, axes = plt.subplots(6,6,figsize=(10,10),subplot_kw = {'xticks':[],'yticks':[]},
gridspec_kw = dict(hspace=0.1,wspace=0.1))
for i, ax in enumerate(axes.flat):
ax.imshow(faces[i].reshape(62,47),cmap='bone')
plt.show()
plot_digits(example_faces)
运行结果:
二、特征脸
代码示例:
from sklearn.decomposition import PCA
pca = PCA(svd_solver='randomized')
pca.fit(X)
pca.components_.shape #2914个主成分,对应2914个向量
plot_digits(pca.components_[:36,:])
运行结果: