Machine Learning Lecture Note 004 -- Classification

最新推荐文章于 2024-10-02 21:43:07 发布

KnightHacker2077

最新推荐文章于 2024-10-02 21:43:07 发布

阅读量144

点赞数

分类专栏： Coursera ML Notes 文章标签：机器学习

本文链接：https://blog.csdn.net/DOITJT/article/details/111464922

版权

Coursera ML Notes 专栏收录该内容

4 篇文章 0 订阅

订阅专栏

Classification vs. Regression

Regression can take continuous output values, while classification can only take discrete values. (0,1,etc.)

Binary classification problems have output values 0 or 1.

Use linear regression and map all predictions greater than 0.5 as a 1 and all less than 0.5 as a 0 doesn’t work well because classification is not actually a linear function.

Logistic function (Sigmoid)

在这里插入图片描述
The graph looks like:

$h_{\theta}(x)$ gives us the probability of output = 1 given $x$ , on the other hand, $1-h_{\theta}(x)$ gives us the probability of output = 0 given $x$ .

When $h_{\theta}(x) >= 0.5$ output = 1, otherwise output = 0.

Therefore, we can say that when $\theta^{T}x >=0$ , we have output = 1, otherwise output = 0.

Decision Boundary

A property of the Logistic Function that separates $h_{\theta}(x)$ = 1 & $h_{\theta}(x)$ = 0

Boundary itself is the set of points that yields $h_{\theta}(x)$ = 0.5.

Cost function

Linear regression cost function won’t work because Logistic function makes it non-convex (has a lot of local optima)

Cost function for Logistic Function is:
在这里插入图片描述

For y = 1, the function looks like:
在这里插入图片描述

For y = 0, the function looks like:

在这里插入图片描述
Here are the properties of Logistic cost function:

We can further compress our cost function to:

Then we make it applicable to multivariate situation: