先看help结果,及返回的结果,就是训练用的数据
make_classification(n_samples=100, n_features=20, n_informative=2,
n_redundant=2, n_repeated=0, n_classes=2, n_clusters_per_class=2, weights=None,
flip_y=0.01, class_sep=1.0, hypercube=True, shift=0.0, scale=1.0, shuffle=True, random_state=None)
X : array of shape [n_samples, n_features]
The generated samples.
y : array of shape [n_samples]
The integer labels for class membership of each sample.
y是0~n_classes-1单个数字的集合,分别对应X的sample
X几个特征就有几列,知道三个参数就可以干活了。
>>> X, y = make_classification(n_samples=100, n_features=5, n_clas