Hierarchical Clustering

最新推荐文章于 2021-03-12 20:27:00 发布

phoenix-bai

最新推荐文章于 2021-03-12 20:27:00 发布

阅读量206

点赞数

分类专栏： PRML

PRML 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

Clustering, in one sentence, is the extraction of natural groupings of similar data objects.
There are a couple of general ideas that occur quite frequently with respect to clustering:

The clusters should be naturally occurring in data.
The clustering should discover hidden patterns in the data.
Data points within the cluster should be similar.
Data points in two different clusters should not be similar.

Common algorithms used for clustering include K-Means, DBSCAN, and Gaussian Mixture Models.

Hierarchical Clustering

As mentioned before, hierarchical clustering relies using these clustering techniques to find a hierarchy of clusters, where this hierarchy resembles a tree structure, called a dendrogram.

Hierarchical clustering is the hierarchical decomposition of the data based on group similarities

Finding hierarcical clusters

There are two top-level methods for finding these hierarchical clusters:

Agglomerative clustering uses a bottom-up approach, wherein each data point starts in its own cluster. These clusters are then joined greedily, by taking the two most similar clusters together and merging them.
Divisive clustering uses a top-down approach, wherein all data points start in the same cluster. You can then use a parametric clustering algorithm like K-Means to divide the cluster into two clusters. For each cluster, you further divide it down to two clusters until you hit the desired number of clusters.

Both of these approaches rely on constructing a similarity matrix between all of the data points, which is usually calculated by cosine or Jaccard distance.

References

phoenix-bai

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Hierarchical Clustering

Clustering, in one sentence, is the extraction of natural groupings of similar data objects.There are a couple of general ideas that occur quite frequently with respect to clustering:The clusters s...
复制链接

扫一扫

专栏目录