机器学习--k-means原理详解以及代码

1.原理参考了这篇博客        https://blog.csdn.net/lyq_12/article/details/81043690

2.k-means代码:matlab

数据聚类

参见:

K-Means聚类算法 Matlab代码

function y=kMeansCluster(m,k,isRand)
%%%%%%%%%%%%%%%%
%                                                       
% kMeansCluster - Simple k means clustering algorithm                                                             
% Author: Kardi Teknomo, Ph.D.                                                                 
%                                                                                                                    
% Purpose: classify the objects in data matrix based on the attributes   
% Criteria: minimize Euclidean distance between centroids and object points                   
% For more explanation of the algorithm, see http://people.revoledu.com/kardi/tutorial/kMean/index.html   

% Output: matrix data plus an additional column represent the group of each object              
%                                                                                                                
% Example: m = [ 1 1; 2 1; 4 3; 5 4]  or in a nice form                        
%          m = [ 1 1;                                                                                    
%                2 1;                                                                                        
%                4 3;                                                                                        
%                5 4]                                                                                         
%          k = 2                                                                                            
% kMeansCluster(m,k) produces m = [ 1 1 1;                                       
%                                   2 1 1;                                                                  
%                                   4 3 2;                                                                  
%                                   5 4 2]                                                                  
% Input:
%   m      - required, matrix data: objects in rows and attributes in columns                                                
%   k      - optional, number of groups (default = 1)
%   isRand - optional, if using random initialization isRand=1, otherwise input any number (default)
%            it will assign the first k data as initial centroids
%
% Local Variables
%   f      - row number of data that belong to group i
%   c      - centroid coordinate size (1:k, 1:maxCol)
%   g      - current iteration group matrix size (1:maxRow)
%   i      - scalar iterator
%   maxCol - scalar number of rows in the data matrix m = number of attributes
%   maxRow - scalar number of columns in the data matrix m = number of objects
%   temp   - previous iteration group matrix size (1:maxRow)
%   z      - minimum value (not needed)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 

if nargin<3,        isRand=0;   end
if nargin<2,        k=1;        end
   
[maxRow, maxCol]=size(m)
if maxRow<=k,
    y=[m, 1:maxRow]
else
 
 % initial value of centroid
    if isRand,
        p = randperm(size(m,1));      % random initialization
        for i=1:k
            c(i,:)=m(p(i),:) 
     end
    else
        for i=1:k
           c(i,:)=m(i,:)        % sequential initialization
     end
    end
   
 temp=zeros(maxRow,1);   % initialize as zero vector
   
 while 1,
        d=DistMatrix(m,c);  % calculate objcets-centroid distances
        [z,g]=min(d,[],2);  % find group matrix g
        if g==temp,
            break;          % stop the iteration
        else
            temp=g;         % copy group matrix to temporary variable
        end
        for i=1:k
            f=find(g==i);
            if f            % only compute centroid if f is not empty
                c(i,:)=mean(m(find(g==i),:),1);
            end
        end
 end
   
 y=[m,g];
   
end

The Matlab function kMeansCluster above call function DistMatrix as shown in the code below. It works for multi-dimensional Euclidean distance. Learn about other type of distance here.

function d=DistMatrix(A,B)

             %%%%%%%%%%%%%%%%%%%%%%%%%

             % DISTMATRIX return distance matrix between points in A=[x1 y1 ... w1] and in B=[x2 y2 ... w2]

             % Copyright (c) 2005 by Kardi Teknomo,  http://people.revoledu.com/kardi/

             %

             % Numbers of rows (represent points) in A and B are not necessarily the same.

             % It can be use for distance-in-a-slice (Spacing) or distance-between-slice (Headway),

             %

             % A and B must contain the same number of columns (represent variables of n dimensions),

             % first column is the X coordinates, second column is the Y coordinates, and so on.

             % The distance matrix is distance between points in A as rows

             % and points in B as columns.

             % example: Spacing= dist(A,A)

             % Headway = dist(A,B), with hA ~= hB or hA=hB

             %          A=[1 2 3; 4 5 6; 2 4 6; 1 2 3]; B=[4 5 1; 6 2 0]
             %          dist(A,B)= [ 4.69   5.83;
             %                       5.00   7.00;
             %                       5.48   7.48;
             %                       4.69   5.83]
             %
             %          dist(B,A)= [ 4.69   5.00     5.48    4.69;
             %                       5.83   7.00     7.48    5.83]
             %%%%%%%%%%%%%%%%%%%%%%%%%%%

             [hA,wA]=size(A);
             [hB,wB]=size(B);
             if wA ~= wB,  error(' second dimension of A and B must be the same'); end
             for k=1:wA
                  C{k}= repmat(A(:,k),1,hB);
                  D{k}= repmat(B(:,k),1,hA);
             end
             S=zeros(hA,hB);
             for k=1:wA
                  S=S+(C{k}-D{k}').^2;
             end
             d=sqrt(S);

 

3.k-means做图像聚类分割代码:matlab

参见:https://download.csdn.net/download/hyy1568786/11192315

代码皆已完成仿真,没有问题

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值