英文随笔(part1)

GUI Research Group

于 2021-10-12 21:28:05 发布

阅读量212

点赞数

分类专栏：英语文章标签：英语练习

本文链接：https://blog.csdn.net/m0_37422217/article/details/120732654

版权

英语专栏收录该内容

10 篇文章 0 订阅

订阅专栏

随笔，练练自己英文写作
翻译自：《数据聚类》-- 张宪超

Unsupervised Learning

The core of Artificial Intelligence is machine learning(ML), whose main task is to identify and distinguish between things. ML is divided into two categories supervised learning and unsupervised learning. The main task of supervised learning is classification, i.e., to complete the distinction of new data with a large number of labeled data. The main task of unsupervised learning is clustering, i.e., to distinct data into many class without manual intervention.

Humanity must be clear aware of that the unsupervised learning is more difficult than supervised learning and there are far fewer researchers in unsupervised than in supervised. Thus, the process of unsupervised development is relatively slow. Nevertheless, the field of unsupervised learning has been explored by scholars for decades. Many research results such as the k-means algorithm were studied. Especially in recent years, with the importance of unsupervised learning has been recognized, more scholars have devoted themselves into this filed and have achieved breakthrough.

Clustering is one of the most important issue in the domain of unsupervised learning. Clustering is employed in many real-world problem, such as image segmentation, bioinformation and finance fraud. Clustering is able to group data which have no label, thus discovering the natural structure of data. Clustering always be apply in three areas as follow.

find latent structure of data
group data naturally
compressed data

Thousands of clustering algorithms have been published by humanity. These algorithms can be divided into division-based algorithm, hierarchy-based algorithm, density-based algorithm etc.

The research about clustering can be divided in three areas.