[Machinie Learning] 吴恩达机器学习课程笔记——Week1

Carsick Car

已于 2022-12-11 09:47:05 修改

阅读量483

点赞数

分类专栏： Machine Learning 文章标签：人工智能线性代数 python

于 2022-12-08 21:03:28 首次发布

本文链接：https://blog.csdn.net/qq_52883908/article/details/128243213

版权

Machine Learning 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

Machine Learning by Andrew Ng

💡 吴恩达机器学习课程学习笔记——Week 1
🐠 本人学习笔记汇总合订本
✓ 课程网址 standford machine learning
🍭 参考资源

课程笔记
python版作业

学习提纲

Introduction
Model and Cost Function
Parameter Learning
Linear Algebra Review

Introduction

Machine Learning Definition 机器学习定义

💡 ML Definition
A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E.

Machine Learning Algorithm 机器学习的分类

Supervised Learning
Unsupervised Learning
other: RL, recommender systems

Supervised Learning 有监督学习

teach the machine to learn, given right answers
给定标注的答案，让机器学习到经验。

有监督学习分类：

Regression
回归 predict a continuous value
Classification
分类 predict a discrete value

Regression 回归
在这里插入图片描述

Classification 分类
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-aRCJEZ5e-1670502920852)(https://s3-us-west-2.amazonaws.com/secure.notion-static.com/e39658a4-189c-4f6c-b347-a7ac23d3ccae/Untitled.png)]
在这里插入图片描述

Unsupervised Learning 无监督学习

ask the machine to find the structure of an unlabeled data set
automatically find the structure of the data 让机器自动学习到数据的结构

在这里插入图片描述

The cocktail party 鸡尾酒派对问题

an unsupervised learning can separate different sources of voices
让机器把派对上多个叠加的声音区分开

在这里插入图片描述

Model and Cost Function

$x^i , y ^i)$ denotes a training example 训练样本的数学表示方式
在这里插入图片描述

Model Representation 模型表示
h means hypothesis
Univariate linear regression 单变量线性回归

在这里插入图片描述

Cost Function

💡 Idea
Chose $\theta_0, \theta_1$ s.t. $h_\theta (x)$ is close to y

define the cost function as
在这里插入图片描述
Squared error cost function 平方差误差

our goal is
$minimize_{\theta_1, \theta_2} \quad \frac{1}{2m} \Sigma_{i=1}^m (h_\theta (x^i) -y^i)^2$

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-RZNjbweq-1670502187521)(Week1%20dd6012e3a1bc4417a95b215cd58c5ab8/Untitled%2010.png)]

Our goal is to minimize the cost function and find the global minimum
在这里插入图片描述

A contour plot/figure to visualize the cost function
在这里插入图片描述

Parameter Learning

Gradient descent 梯度下降法

a general algorithm to minimize the function

在这里插入图片描述

Intuition
在这里插入图片描述

Learning rate
Simultaneously Update All Parameters
一个容易出错的点，注意是同时去更新所有的参数

derivative 导数
the (partial) derivative term
在这里插入图片描述

the slope of a line 斜率
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-RFaKtJTw-1670503991613)(https://s3-us-west-2.amazonaws.com/secure.notion-static.com/381a13d1-e126-4d6c-857a-0f1f2689b8a5/Untitled.png)]

Learning rate 学习率的重要性

A small lr can lead to a slow converge
A large lr can lead to failure of converge or even diverge

If initialized at a local optima
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-49ODB41k-1670502187527)(Week1%20dd6012e3a1bc4417a95b215cd58c5ab8/Untitled%2020.png)]

固定的学习率就可以让模型收敛

Gradient descent can converge to a local minima even with a fixed learning rate. Because the derivate term is becoming smaller when approaching the local minima

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-jqWI8OWu-1670502187527)(Week1%20dd6012e3a1bc4417a95b215cd58c5ab8/Untitled%2021.png)]

Gradient Descent for Linear Regression

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-UW02bTup-1670502187527)(Week1%20dd6012e3a1bc4417a95b215cd58c5ab8/Untitled%2022.png)]

convex function: a bow shaped function 凸函数

A convex function always converge to global minimum when using gradient descent with an appropriate learning rate (there is not local minima)

following the trajectory, it reaches the global minimum
在这里插入图片描述

Above is called Batch Gradient Descent

Each step of gradient descent we use all training examples

Linear Algebra Review 线性代数

Matrix
在这里插入图片描述

Matrix Elements (entries of matrix)
在这里插入图片描述

Vector: an n by 1 matrix 向量是n行1列的矩阵
1-indexed vs 0-indexed 两种写法
在这里插入图片描述

Capital case for matrices A B C
Lower case for vectors a b c

Addition and Scalar Multiplication
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-9lIGKSd7-1670502187528)(Week1%20dd6012e3a1bc4417a95b215cd58c5ab8/Untitled%2027.png)]

Matrix Vector Multiplication
[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-cmRANKTT-1670502187529)(Week1%20dd6012e3a1bc4417a95b215cd58c5ab8/Untitled%2029.png)]
在这里插入图片描述