meanshift

最新推荐文章于 2022-12-07 00:55:01 发布

竹子熊猫

最新推荐文章于 2022-12-07 00:55:01 发布

阅读量470

点赞数

分类专栏： python 文章标签： python

本文链接：https://blog.csdn.net/summermaoz/article/details/122476004

版权

python 专栏收录该内容

63 篇文章 1 订阅

订阅专栏


import numpy as np
from sklearn.cluster import MeanShift, estimate_bandwidth


def meanShift(features):
    '''
    '''
 
    features = np.array(features).astype(np.float64)

    '''本质上就是求平均最远k近邻距离, quantile的值表示进行近邻搜索时候的近邻占样本的比例'''
    bandwidth = estimate_bandwidth(features, quantile=0.2)
    ms = MeanShift(bandwidth=bandwidth, bin_seeding=True)
    ms.fit(features)
    labels = ms.labels_
    unique_labels = set(labels)

    # core_samples_mask = np.zeros_like(labels, dtype = bool)
    # core_samples_mask[ms.core_sample_indices_] = True

    clusters = []
    for k in unique_labels:
        ##-1表示噪声点,这里的k表示黑色
        ##生成一个True、False数组，lables == k 的设置成True
        class_member_mask = (labels == k)
        index = np.where(class_member_mask==True)[0]        
        ##两个数组做&运算，找出即是核心点又等于分类k的值  markeredgecolor='k',
        # index = np.where(class_member_mask & core_samples_mask==True)[0]

        clusters.append(index)

    ##### sort ############
    clusters = sorted(clusters, key = lambda x:len(x), reverse=True)
    
    return clusters

竹子熊猫

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
meanshift

import numpy as npfrom sklearn.cluster import MeanShift, estimate_bandwidthdef meanShift(features): ''' ''' features = np.array(features).astype(np.float64) '''本质上就是求平均最远k近邻距离, quantile的值表示进行近邻搜索时候的近邻占样本的比例''' bandwidth = estim..
复制链接

扫一扫