Weka Explorer

最新推荐文章于 2021-02-22 14:30:06 发布

nomad2

最新推荐文章于 2021-02-22 14:30:06 发布

阅读量2k

点赞数

分类专栏： Data Mining 文章标签： classification tools tree 算法测试

本文链接：https://blog.csdn.net/nomad2/article/details/7233565

版权

Data Mining 专栏收录该内容

47 篇文章 0 订阅

订阅专栏

refer to 《Data Mining,Practical Machine Learning Tools and Techniques 2nd》chapter 10

1. weka explorer 界面如下:

功能简介：

1） preprocess: 选择数据集，并以多种方式进行修改

2）分类： train learning schemes that perform classification or regression and evaluate them

例如： IB1是一个基本的基于实例的分类器。

3）聚类：learn clusters for the dataset.

4）关联：学习数据的关联规则并对其评估

5）选择属性：在数据集中选择最相关的部分

6）可视化：View different two-dimensional plots of the data and interact with them.

2. UserClassifier

选择segment-challenge.arff，测试数据选择segment-test.arff，XY的选择如下图所示，用来选择的标准是找出一个能使类与类分离的尽可能干净的X和Y的组合。

submit后，查看产生的tree，如下图所示，

继续增加节点，到几乎所有节点都是纯的为止。自己构建太累，点击任何一个节点都可以使用系统的分类法构造子树。

3. 分类器

使用k-means分类Iris数据，

4. 关联规则挖掘

使用Apriori算法来挖掘天气数据关联规则，结果如下，

Best rules found:

 1. humidity=normal windy=FALSE 4 ==> play=yes 4    conf:(1)
 2. temperature=cool 4 ==> humidity=normal 4    conf:(1)
 3. outlook=overcast 4 ==> play=yes 4    conf:(1)
 4. temperature=cool play=yes 3 ==> humidity=normal 3    conf:(1)
 5. outlook=rainy windy=FALSE 3 ==> play=yes 3    conf:(1)
 6. outlook=rainy play=yes 3 ==> windy=FALSE 3    conf:(1)
 7. outlook=sunny humidity=high 3 ==> play=no 3    conf:(1)
 8. outlook=sunny play=no 3 ==> humidity=high 3    conf:(1)
 9. temperature=cool windy=FALSE 2 ==> humidity=normal play=yes 2    conf:(1)
10. temperature=cool humidity=normal windy=FALSE 2 ==> play=yes 2    conf:(1)