机器学习应用到越来越多的领域,如data mining, natural language processing, image recognition, and expert systems
什么叫机器学习?
“A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E.” -- Tom Mitchell, Carnegie Mellon University 1997年
机器学习分为两种:
1. 监督学习:
- The program is “trained” on a pre-defined set of “training examples”, which then facilitate its ability to reach an accurate conclusion when given new data.
2. 非监督学习
- The program is given a bunch of data and must find patterns and relationships therein.
监督学习:设计一个预测函数h(x), 简化one input vaule模型 ,where and are constants. Our goal is to find the perfect values of and to make our predictor work as well as possible.
使用什么方式来得到和呢? Optimizing the predictor h(x)
is done using training examples.
比如input数据x_train, 相应的Output 数据y。我们预测的值是h(x_train), 那么在h(x_train)和他的真实值y之间就会产生diff, 通过大量的train, 我们可以调整和的值,使得diff最小,也就是优化过程。这样我们就能通过 来给出真实世界的预测值。