论文欣赏—A Linear Assignment Clustering Algorithm Based on the Least Similar Cluster Representatives

最新推荐文章于 2023-08-26 19:19:43 发布

老实人小李

最新推荐文章于 2023-08-26 19:19:43 发布

阅读量154

点赞数

分类专栏：论文欣赏文章标签：聚类算法

本文链接：https://blog.csdn.net/weixin_43660703/article/details/108467384

版权

论文欣赏专栏收录该内容

2 篇文章 0 订阅

订阅专栏

这篇博文列出了主要的公式，和讲解视频一起看效果更佳！
讲解视频

Similarity and Dissimilarity Coefficients——相似度不相似度的定义

首先我们来看一下两个系数的定义方法，Similarity and dissimilarity coefficients代表着两个数据之间的相似和不相似的程度
一个典型的dissimilarity coefficient是the Minkowski metric
$d_{i,j}=\bigg[\sum_{k=1}^m(a_{ki}-a_{kj})^q\bigg]^{\frac{1}{q}}$
其中 $q > 0$
定义similarity coefficient
$s_{ij}=\frac{\sum_{k=1}^{m}a_{ki}a_{kj}}{\sum_{k=1}^{m}(a_{ki}+a_{kj}-a_{ki}a_{kj})}$

Cluster Representatives——选代表

方法一：
${r_1,r_2\}=arg\min_{(i,j)}s_{ij}$
$r_k=arg\min_{i\in\{1,2,...,k-1\}}\sum_{j=1}^{k-1}s_ir_j\\ k=3,4,...p$
其中 $r_k$ 表示的是第k个cluter的index
方法二：
$\{r_1,r_2...r_p\}=arg\min_{r\in\{1,2,...p\}}\bigg\{\sum_{i=1}^n\sum_{r<i}s_{r_ir_j}|r_i,r_j\in\{1,2,...,n\}\bigg\}$

Linear Assignment Model

$Maximize\quad\sum_{i=1}^n\sum_{k=1}^ps_{ir_k}x_{ik}\quad or \quad minimize\quad\sum_{i=1}^n\sum_{k=1}^pd_{ir_k}x_{ik}$
$subject\;to\quad\sum_{i=1}^nx_{ik}=1\qquad k=1,2,...p$
$\sum_{k=1}^px_{ik}\le u\qquad i=1,2,...n$
$x_{ik}\ge0\qquad i=1,2,...,n;\quad k=1,2,...,p$
其中 $x_{ik}$ 是二值决策变量

Assignment Clustering Algorithm

Step 0: Set $I=\{i|1,2,...,n\}$ and $K=\{k|1,2,...p\}.$
Step 1: Load the number of clusters n and the upper bound of data per cluster u:
Step 2: Load or compute the similarity coefficients between every pair of data.
Step 3: Determine cluster representatives using (15) and (16) or (17), then remove $r_k\;(k|1,2,...p)$ from I.
Step 4: Determine $(v,w)=arg\max_{i\in I,k\in\ K}s_{ir_k}$ .
Step 5: If the number of data in cluster w is u; then remove w from K and go to Step 4; otherwise, assign datum v to cluster w and delete v from I:
Step 6: If $I\ne \emptyset$ ; go to Step 4.
Step 7: Evaluate the clustering result using one or more performance criteria.

老实人小李

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
论文欣赏—A Linear Assignment Clustering Algorithm Based on the Least Similar Cluster Representatives

这个篇论文可以很好用一个社会例子来理解：就像在我们的社会群里中，有医生，律师，资本家，工人，农民，地主等职业。如果要将他们分为p=3类，我们首先要将他们的（不）相似度进行一个定义，医生是对病人的身体进行评估，找到病症的发病点，给出病人的处方；律师同样的，对是遭受“病”（官司）的“病人”（当事人）的案情进行评估，找到案情的矛盾点和突破口，最后也是以“处方”（律师函，起诉书）的形式来处理这个的“病”（案），所以我们把这个方面属性作为一个向量的一个元素，有这个属性定义为1，没有这个属性定义为0，同样的老师和驾校教
复制链接

扫一扫