python 编写聚类指标purity纯度和jaccard函数

最新推荐文章于 2024-06-13 06:09:31 发布

BloodyBlondie

最新推荐文章于 2024-06-13 06:09:31 发布

阅读量2.1k

点赞数 5

分类专栏： python 文章标签： python

本文链接：https://blog.csdn.net/weixin_45529837/article/details/106313295

版权

python 专栏收录该内容

13 篇文章 2 订阅

订阅专栏

自编purity纯度和jaccard函数，最后运算速度都挺快的，另外，似乎用scipy中的混淆矩阵也可以编写scipy，而且要比我写的jaccard简便一些，可能是把我写的一些封装了吧。

from sklearn import datasets
from sklearn.utils.linear_assignment_ import linear_assignment
import seaborn as sns
import matplotlib.pyplot as plt
import copy
from sklearn.metrics import confusion_matrix
from sklearn import metrics
from sklearn.cluster import KMeans
import pandas as pd
import numpy as np

纯度

def purity(cluster, label):
    cluster = np.array(cluster)
    label = np. array(label)
    indedata1 = {}
    for p in np.unique(label):
        indedata1[p] = np.argwhere(label == p)
    indedata2 = {}
    for q in np.unique(cluster):
        indedata2[q] = np.argwhere(cluster == q)

    count_all = []
    for i in indedata1.values():
        count = []
        for j in indedata2.values():
            a = np.intersect1d(i, j).shape[0]
            count.append(a)
        count_all.append(count)

    return sum(np.max(count_all, axis=0))/len(cluster)

jaccard

def jaccard(cluster, label):
    dist_cluster = np.abs(np.tile(cluster, (len(cluster), 1)) -
                          np.tile(cluster, (len(cluster), 1)).T)
    dist_label = np.abs(np.tile(label, (len(label), 1)) -
                        np.tile(label, (len(label), 1)).T)
    a_loc = np.argwhere(dist_cluster+dist_label == 0)
    n = len(cluster)
    a = (a_loc.shape[0]-n)/2
    same_cluster_index = np.argwhere(dist_cluster == 0)
    same_label_index = np.argwhere(dist_label == 0)
    bc = same_cluster_index.shape[0]+same_label_index.shape[0]-2*n-2*a
    return a/(a+bc)

BloodyBlondie

关注

5
点赞
踩
10

收藏

觉得还不错? 一键收藏
5
评论
python 编写聚类指标purity纯度和jaccard函数

自编purity纯度和jaccard函数，最后运算速度都挺快的，另外，似乎用scipy中的混淆矩阵也可以编写scipy，而且要比我写的jaccard简便一些，可能是把我写的一些封装了吧。from sklearn import datasetsfrom sklearn.utils.linear_assignment_ import linear_assignmentimport seaborn as snsimport matplotlib.pyplot as pltimport copyfrom
复制链接

扫一扫