Machine Learning Notes - Introduction

Introduction


机器学习英文做笔记,顺便学英语。


What is Machine Learning?

There isn’t a well accepted definition of what is and what isn’t machine learning.

A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E.

— Tom


Example

Let’s say your email program watches which emails you do or do not mark as spam. So in an email client like this, you might click the Spam button to report some email as spam but not other emails. And based on which emails you mark as spam, say your email program learns better how to filter spam email.

  • classifying emails is the task T.
  • watching you label emails as spam or not spam is the experience E.
  • the fraction of emails correctly classified, that might be a performance measure P.

There are several different types of learning algorithms.

The main two types are what we call supervised learning and unsupervised learning.

I hope to make you one of the best people in knowing how to design and build serious machine learning and AI systems.


Supervised Learning

In supervised learning, we are given a data set and already know what our correct output should look like, having the idea that there is a relationship between the input and the output.

Supervised learning problems are categorized into “regression”(回归) and “classification”(分类) problems. In a regression problem, we are trying to predict results within a continuous output, meaning that we are trying to map input variables to some continuous function. In a classification problem, we are instead trying to predict results in a discrete output. In other words, we are trying to map input variables into discrete categories.

An example of a regression problem.

put a straight line through the data, also fit a straight line to the data. (purple)

And there might be a better one. For example, instead of fitting a straight line to the data, we might decide that it’s better to fit a quadratic function, or a second-order polynomial to this data. (blue)

在这里插入图片描述

An example of a classification problem.

在这里插入图片描述

We’re trying to predict a discrete value output zero or one. sometimes you can have more than two possible values for the output.

Another classification problem.

在这里插入图片描述
The learning algorithm can deal with an infinite number of features.

Unsupervised Learning

Unsupervised learning allows us to approach problems with little or no idea what our results should look like. We can derive structure from data where we don’t necessarily know the effect of the variables.

We can derive this structure by clustering(聚集) the data based on relationships among the variables in the data.

With unsupervised learning there is no feedback based on the prediction results.

Example

Clustering: Take a collection of 1,000,000 different genes, and find a way to automatically group these genes into groups that are somehow similar or related by different variables, such as lifespan, location, roles, and so on.
在这里插入图片描述

Non-clustering: The “Cocktail Party Algorithm”, allows you to find structure in a chaotic environment. (i.e. identifying individual voices and music from a mesh of sounds at a cocktail party).

在这里插入图片描述

words and phrases

English中文English中文
practitioner实践者;实习者the field of领域
claim宣称remarkable卓越的;非凡的
occasionally偶尔;间或the fraction ofxx的比例
make sense有意义;言之有理properly适当地;正确地;
regression回归classification分类
discrete离散的categorize分类
discrete离散的categorize分类
lifespan寿命discrete离散的
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值