**
一、什么是机器学习
**
1.Arthur Samuel提出的定义:
“The field of study that gives computers the ability to learn without being explicitly programmed.”
2.Tom Mitchell提出的定义:
“A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.”
二、监督学习
In supervised learning, we are given a data set and already know what our correct output should look like, having the idea that there is a relationship between the input and the output.
监督学习的问题被分为“回归(regression)”和“分类(classification)”两类。
1、在回归问题中,我们尝试预测出连续的输出。换句话说,我们是尝试将变量映射到某一个连续函数上。
2、在分类问题中,我们尝试预测出离散的输出。换句话说,我们是尝试将变量映射到某一些离散分类里。
三、无监督学习
Unsupervised learning, allows us to approach problems with little or no idea what our results should look like. We can derive structure from data where we don’t necessarily know the effect of the variables.
无监督学习,使我们能够处理那些对结果了解甚少、甚至根本不了解的问题。我们可以在不知道变量的具体影响的情况下,从数据中提取出结构(structure)。无监督学习不能从预测的结果中得到反馈(没有性能测度),也就是说,没有老师来纠正你。
我们可以根据数据中的变量关系对数据进行聚类(clustering),来提取出数据的结构。