[Machinie Learning] 吴恩达机器学习课程笔记——Week3

Carsick Car

已于 2022-12-12 16:38:32 修改

阅读量221

点赞数

分类专栏： Machine Learning 文章标签：人工智能深度学习

于 2022-12-12 16:23:09 首次发布

本文链接：https://blog.csdn.net/qq_52883908/article/details/128288587

版权

Machine Learning 专栏收录该内容

6 篇文章 0 订阅

订阅专栏

Machine Learning by Andrew Ng

💡 吴恩达机器学习课程学习笔记——Week 3
🐠 本人学习笔记汇总合订本
✓ 课程网址 standford machine learning
🍭 参考资源

课程笔记
python版作业

学习提纲

Classification and Regression
Logistic Regression Model
Multi-class Classification
Solving the Problem of Overfitting

Classification and Regression

1.Classification

label 0 denotes the negative class (the absence of sth)
label 1 denotes the positive class(the presence of sth)
but it is rather arbitrary to decide which label denotes the negative/positive class

Linear Regression is not a good idea as the values to predict take on a small number of discrete values and linear regression would exceed those values.

Logistic Regression is a classification algorithm 分类算法
(not a regression algorithm as its name may indicate)

2.Hypothesis Representation
we want our classifier
$\le h_\theta(x) \le 1$

we turn the linear regression function
$h_\theta = \theta^T x$
into
$h_\theta = g(\theta^T x)$
where g is
$\frac{1}{1+e^{-z}}$
then we get
$h_\theta = \frac{1}{1+e^{-{\theta^T x}}}$
g is called sigmoid function or logistic function。

Sigmoid函数的性质：

g asymptotes at 0 as z goes to minus infinity, g asymptotes at 1 as z goes to infinity

The look of the sigmoid function 函数曲线

malignant 恶性 benign 良性

Interpretation of Hypothesis Output 解读假设函数的输出
The probability that y = 1, given x, parameterized by $\theta$ 在 $\theta$ 参数下，给定x，y=1的概率

3.Decision Boundary 决策边界

The Decision Boudary

The boundary is decided by parameters, not the training set

with more higher order polynomial terms, we can get more complex decision boundaries
通过构造高阶多项式函数，我们可以得到更加复杂的决策边界。

Logistic Regression Model

1.Cost Function
= optimization objective 优化目标

Given a training set, how to choose $\theta$

comma 逗号 ,

Cost function of linear regression

if we directly use the cost function of linear regression, it turn out to be a non-convex function. 不能直接套用线性回归的损失函数，因为用于分类问题的话，它是非凸的。

so, we have to find a new const function to make J convex

$J(\theta) = \frac{1}{m} \Sigma_{i=1}^m Cost(h_\theta(x^i, y))$

在这里插入图片描述

损失函数的性质：

2.Simplified Cost Function and Gradient Descent

We can compress the cost function’s two conditional cases in to one case
$C o s t (h θ (x), y) = - y l o g (h θ (x)) - (1 - y) l o g (1 - h θ (x))$

The full const function
$J(\theta) = - \frac{1}{m} \displaystyle \sum_{i=1}^m [y^{(i)}\log (h_\theta (x^{(i)})) + (1 - y^{(i)})\log (1 - h_\theta(x^{(i)}))$