DMX - SQL SERVER 数据挖掘聚类

最新推荐文章于 2023-05-23 18:21:14 发布

incognito007

最新推荐文章于 2023-05-23 18:21:14 发布

阅读量2.1k

点赞数

分类专栏： DMX

本文链接：https://blog.csdn.net/sfrankzhang/article/details/8190069

版权

本文探讨了SQL SERVER数据挖掘中的聚类算法，包括SCALEABLE EM、NO SCALEABLE EM、SCALEABLE KM和NO SCALEABLE KM四种，并通过案例对比了它们的性能。根据测试结果，NO SCALEABLE EM算法在Case Likelihood指标上表现最优。

摘要由CSDN通过智能技术生成

聚类有如下特征：

如果列标为，

1，PREDICT，预测可选；

2， INPUT，预测不可选；

3，PREDICT ONLY，TRAINING忽略。

一共有四种算法：

1，SCALEABLE EM

2， NO SCALEABLE EM

3， SCALEABLE KM

4， NO SCALEABLE KM

下面一个例子比较四种算法

create mining structure [Clustering Method]
(
[Age] long discretized(automatic,10),
[Bike Buyer] long discrete,
[Commute Distance] text discrete,
[Customer Key] long key,
[Education] text discrete,
[Gender] text discrete,
[House Owner Flag] text discrete,
[Marital Status] text discrete,
[Number Cars Owned] long discrete,
[Number Children At Home] long discrete,
[Occupation] text discrete,
[Region] text discrete,
[Total Children] long discrete,
[Yearly Income] double continuous
)

alter mining structure [Clustering Method]
add mining model [Clutering_SEM]
using microsoft_clustering
(CLUSTERING

最低0.47元/天解锁文章

incognito007

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
3
评论
DMX - SQL SERVER 数据挖掘聚类

聚类有如下特征：如果列标为，1，PREDICT，预测可选；2， INPUT，预测不可选；3，PREDICT ONLY，TRAINING忽略。一共有四种算法：1，SCALEABLE EM2， NO SCALEABLE EM3， SCALEABLE KM4， NO SCALEABLE KM 下面一个例子比较四种算法create mining struct
复制链接

扫一扫

专栏目录