我的输入特性集是csv文件的形式。我用过
http://scikit-learn.org/stable/auto_examples/cluster/plot_kmeans_silhouette_analysis.html
在我的未标记数据上选择簇数,并且我得到的是cluster size=3的最高分数
因此我的簇=3。有没有一种方法可以为kmeans聚类算法的每个输入行生成输出标签?在
基本上我想打印出kmeans算法的所有输入特性和结果(集群标签)
Ie我想打印出所有输入特性和分配给每行输入csv文件的集群标签。在
python代码import pandas
from pandas import read_csv
names = ["Variable1”, "Variable2”, "Variable3”, "Variable4”, "Variable5”, "Variable6”, "Variable7”, "Variable8”, "Variable9”, "Variable10”, "Variable11”, "Variable12”, "InputClass”]
filename = 'C:/Users/svx/d_features.csv'
dataframe = read_csv(filename, names=names)
array = dataframe.values
X = array[:,2:11]
range_n_clusters = [2, 3, 4, 5, 6]
for n_clusters in range_n_clusters:
# Create a subplot with 1 r