本系列需要记录一些scikit-learn的用法,防止遗忘
1.KMeans
sklearn.cluster.KMeans
from sklearn.cluster import KMeans
from sklearn import datasets
import numpy as np
def test1():
X = np.array([[1, 2], [1, 4], [1, 0],[10, 2], [10, 4], [10, 0]])
kmean = KMeans(n_clusters=2).fit(X)
print(kmean.labels_)
# center of each class
print(kmean.cluster_centers_)
# variance
print(kmean.inertia_)
# times
print(kmean.n_iter_)
# predict other simple
print(kmean.predict([[1,2],[10,3]]))
2.MiniBatchKMeans
sklearn.cluster.MiniBatchKMeans
此方法是基于采样进行聚类的,并不对所有的样例进行聚类,每次更新只取其中的一部分
from sklearn.cluster import MiniBatchKMeans
import numpy as np
def test2():
X = np.array([[1, 2], [1, 4], [1, 0], [10, 2], [10, 4], [10, 0]])
# batch_size 为每次更新选取的样本数量
minikmean = MiniBatchKMeans(n_clusters=2, batch_size=5).fit(X)
print(minikmean.labels_)
# 具体用法与Kmeans大致相同
K-Mean详情参考 博客