以Adult数据集为例
将adult.data.txt文件改为csv格式,用Weka Explorer打开adult.data.csv然后保存为arff文件。
将adult.test.txt文件也作如上操作,会出现错误:
|1x3 Cross validator
25, Private, 226802, 11th, 7, Never-married, Machine-op-inspct, Own-child, Black, Male, 0, 0, 40, United-States, <=50K.
38, Private, 89814, HS-grad, 9, Married-civ-spouse, Farming-fishing, Husband, White, Male, 0, 0, 50, United-States, <=50K.
28, Local-gov, 336951, Assoc-acdm, 12, Married-civ-spouse, Protective-serv, Husband, White, Male, 0, 0, 40, United-States, >50K.
44, Private, 160323, Some-college, 10, Married-civ-spouse, Machine-op-inspct, Husband, Black, Male, 7688, 0, 40, United-States, >50K.
18, ?, 103497, Some-college, 10, Never-married, ?, Own-child, White, Female, 0, 0, 30, United-States, <=50K.
问题其实处在第一行:
把第一行删去,将adult.data.csv中的第一行粘贴过来(在weka中会被解析为属性名称,所以必须是每个字段独一无二的)
问题解决。