数据降维特征选择:
过滤式:
var = VarianceThreshold(threshold=0)
data = var.fit_transform([[0, 2, 0, 3], [0, 1, 4, 3], [0, 1, 1, 3]])
print(data)
PCA
pca = PCA(n_components=0.9)
data = pca.fit_transform([[2,8,4,5],[6,3,0,8],[5,4,9,1]])
print(data)
降维案例:
算法的分类和开发流程:
数据集划分:
from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris
ls = load_iris()
x_train,x_test,y_train,t_test = train_test_split(ls.data,ls.target,test_size=0.2,random_state=22)
print(x_train)
转换器与估计器:
估计器流程: