机器学习--K-Means

aWty_

于 2024-09-12 12:14:46 发布

阅读量1.2k

点赞数 8

分类专栏： ------Machine Learning------ 文章标签：机器学习 kmeans 人工智能

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/ID246783/article/details/142172068

版权

------Machine Learning------ 专栏收录该内容

11 篇文章 0 订阅

订阅专栏

K均值聚类

算法过程

$K - m e an s$ 是聚类 $c l u s t er in g$ 算法的一种，就是给你一坨东西，让你给他们分类：

在这里插入图片描述

我们的 $K - m e an s$ 大概是这样一个流程：

第一步随机生成两个点（因为这里我想分两类，你想分几类你就弄几个点），标记为两个聚类中心 $\; centriod$ ，像这样：

在这里插入图片描述

然后重复以下两个步骤：

1. 遍历每个点 $x^{(i)}$ ，分别计算点 $x^{(i)}$ 到两个聚类中心的距离 $d_1$ 和 $d_2$ ，然后比较大小。并标记这个点为距离更小的那一类

2. 分别遍历同一类的所有点，计算这些点的几何平均位置，并把聚类中心移动到这个位置

这样说起来可能很抽象，我们还是用图像来更清晰的表示一下这个过程：

在这里插入图片描述

图画到这里我们就能明显的观察到两个聚类已经被划分好了。

优化目标函数

像前面介绍的线性回归、逻辑回归、 $S V M$ 一样，这里的 $K - m e an s$ 也有一个用于优化的函数：

$n o t a t i o n$ ： $c_i$ 表示点 $x_i$ 的类别， $\mu_k$ 表示聚类中心 $k$ ， $\mu_{c_i}$ 表示 $x_i$ 所属的那个聚类中心

$J(c_1, \cdots, c_m, \mu_1, \cdots, \mu_K) = \frac{1}{m}\sum_{i = 1}^m |x_i - \mu_{c_i}|^2$

我们要做的就是：

$\min\limits_{c, \mu} J(c_1, \cdots, c_m, \mu_1, \cdots, \mu_K)$

看得出来，这就是要最小化所有点 $x_i$ 与其所属的聚类中心 $\mu_{x_i}$ 的距离的平方和。

跑 $114514$ 次 $k - m e an s$

可能你也注意到了，我们如果只跑一遍 $k - m e an s$ 的话可能不会得到一个很好的分类方案，所以我们考虑每次随机初始化聚类中心，然后跑很多遍（取决于你的数据规模和时间） $k - m e an s$ ，对于每次计算出来的 $\mu$ 算出它的 $\mu)$ ，然后在其中选择 $\mu)$ 最小的那个分类方案作为最后的答案。

关注

8
点赞
踩
19

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

aWty_ CSDN认证博客专家 CSDN认证企业博客

码龄3年

162: 原创

12万+: 周排名

2万+: 总排名

9万+: 访问

: 等级

1867: 积分

200: 粉丝

223: 获赞

34: 评论

334: 收藏

私信

关注

热门文章

分类专栏

最新评论

机器学习--神经网络
2401_84079994: I guess the math part is used in certain functions in the whole process flow. I did not try to memorize all the related functions. At least not doing so before I can catch the supporting logic and/or math prove behind. In fact I am trying to run the simple network on a very simple demo example provided in a book, line by line, for about 2 months on and off. Kind of frustrated by the line by line explanations on the book. Only understand what it is doing but lack why it is doing so. Well, it is college experience. Professor can not tell everything and student can not learn everything.
机器学习--神经网络
aWty_: In a particular training, those parameters are randomly given in the initialization part, and the SGD algorithm would help you adjust those parameters automatically to reach a lower loss, which means reaching higher accuracy for prediction. So the initialization for parameters is not a part of network design, but a part of training process. That is my understanding for your question.
机器学习--神经网络
aWty_: In a particular training, those parameters are randomly given in the initialization part, and the SGD algorithm would help you adjust those parameters automatically to reach a lower loss, which means reaching higher accuracy for prediction. So the initialization for parameters is a part of network design, but a part of training process. That is my understanding for your question.
机器学习--神经网络
2401_84079994: thanks a lot for reply. any input fills my curious mind. for deep network, my thought is that it may run much less times in loop while simple network may run much more time in loop, given a timeframe. but, since the designs are different with unknown effect on back and forth process between any two layers, the deeper network may still get better result than simpler network. by the way let me ask a dummy question about the math part. is it true that, given a designed network, you do have a fixed set of parameters (or variables, like x1,x2, etc) and therefore those partial derivatives comes out in play? if true, those pre-fixed parameters are part of the neural network design for a problem? if it is still true, I would guess that the deep learning process may have certain capacity to increase/decrease parameters and change parameter values at the same time in order to generate better result. to this point, it suddenly tastes like partial random evolution process. LOL. thanks again. If I have more energy I may get into the math part a little bit and ask for more understanding.
机器学习--神经网络
aWty_: Firstly, sorry for being unable to answer your questions, I'm also a green hand learner in the machine learning field. However, I would still like to share my viewpoint on your questions. 1. The network cannot guarantee 100% accuracy for predictions, yet you can increase it by adjusting parameters while training your network with SGD. 2. If you are curious about how to design a network, I recommend you learn something about LeNet AlexNet VGG GoogLeNet ResNet NiN and so on. These are both the nets that are proven to be efficient in solving problems like computer vision. 3. As for the third question, the answer is absolutely no. As we all know, the more middle layers a net has, the more time it requires to train. So in a fixed timeframe, an over-complex net may even unable to finish training, let alone provide a better prediction. Additionally, if a network has too many layers, it may encounter a problem called overfitting. So the answer is NO.

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。