文章目录
Hierarchical clustering: revisited
- create nested clusters
- agglomerative clustering algorithms vary in terms of how the proximity of two clusters are computed
- MIN(single link): susceptible to noise/outliers
- MAX/GROUP AVERAGE: may not work well with non-globular clusters
- CURE algorithm tries to handle both problems
- Often starts with a proximity matrix
- a type of graph-based algorithm
CURE: another hierarchical approach
- uses a number of points to represent a cluster
- Representative points are found by selecting a constant number of points from a cluster and then “shrinking” them toward the center of the cluster
- Cluster similarity is the similarity of the closest pair of representative points from different clusters
Shrinking