聚类算法Clustering-KMeans/DBSCAN/DenPeak/NormalizeCut/RCC

最新推荐文章于 2024-01-19 17:26:34 发布

泽泽馥泽泽

最新推荐文章于 2024-01-19 17:26:34 发布

阅读量2.3k

点赞数

分类专栏： Clustering 文章标签： kmeans dbscan rcc denpeak normalize cut

本文链接：https://blog.csdn.net/Zhongsigen/article/details/83623443

版权

本文结构安排

经典聚类算法：线性聚类 Kmeans
经典聚类算法：非线性聚类 DBSCAN、谱聚类
新兴聚类算法：DenPeak，RCC

K-means

K-means clustering is a method of vector quantization, originally from signal processing, that is popular for cluster analysis in data mining. k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster.

Given a set of observations $x_{1},x_{2},..., x_{n})$ ,where each observation is a d-dimensional real vector,k-means clustering aims to partition the n observations into $k(\in n)$ set $S_{1},S_{2},..., S_{k})$ so as to minimize the within-cluster sum of variance,the objective is to find:

$arg\min_{s}\sum_{i=1}^{k}\sum_{x \in S_{i}}||x-\mu_{i}||^{2}$

where $\mu_{i}$ is the mean of points in $S_{i}$ .

Kmeans伪代码.png

DBSCAN

Density-based spatial clustering of applications with noise (DBSCAN) is a data clustering algorithm.It
is a density-based clustering algorithm: given a set of points in some space, it groups together points that are closely packed together (points with many nearby neighbors), marking as outliers points that lie alone in low-density regions (whose nearest neighbors are too far away). DBSCAN is one of the most common clustering algorithms and also most cited in scienti c literature.

Consider a set of points in some space to be clustered. For the purpose of DBSCAN clustering, the points are classi ed as core points,density-reachable points and outliers, as follows:

A point p is a core point if at least minPts points are within distance $\varepsilon$ ( $\varepsilon$

最低0.47元/天解锁文章

泽泽馥泽泽

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
聚类算法Clustering-KMeans/DBSCAN/DenPeak/NormalizeCut/RCC

本文结构安排经典聚类算法：线性聚类 Kmeans经典聚类算法：非线性聚类 DBSCAN、谱聚类新兴聚类算法：DenPeak，RCCK-meansK-means clustering is a method of vector quantization, originally from signal processing, that is popular for cluster ana...
复制链接

扫一扫

专栏目录