归类问题：简单的代价函数和梯度下降----吴恩达机器学习

最新推荐文章于 2020-04-24 17:12:18 发布

三省少年

最新推荐文章于 2020-04-24 17:12:18 发布

阅读量202

点赞数

分类专栏：机器学习 coursera 文章标签：归类问题代价函数

本文链接：https://blog.csdn.net/xd15010130025/article/details/100019204

版权

机器学习同时被 2 个专栏收录

36 篇文章 5 订阅

订阅专栏

coursera

19 篇文章 0 订阅

订阅专栏

Logistic Regression--Simplified Cost Function and Gradient Descent

1.Cost fuction
2.Gradient Descent
3.supplementary

1.Cost fuction

We can compress our cost function’s two conditional cases into one case: $Cost(h_\theta(x),y)=-ylog(h_\theta(x))-(1-y)log(1-h_\theta(x))$ Notice that when y is equal to 1, then the second term $(1-y)\log(1-h_\theta(x))$ will be zero and will not affect the result. If y is equal to 0, then the first term $\log(h_\theta(x))$ will be zero and will not affect the result.
We can fully write out our entire cost function as follows: $J(\theta) = - \frac{1}{m} \displaystyle \sum_{i=1}^m [y^{(i)}\log (h_\theta (x^{(i)})) + (1 - y^{(i)})\log (1 - h_\theta(x^{(i)}))]$ A vectorized implementation is:
$h = g (X θ)$
$J(θ)=1m⋅(−y^Tlog(h)−(1−y)^Tlog(1−h))$

2.Gradient Descent

Remember that the general form of gradient descent is: $\{ \theta_j:=\theta_j-\alpha \frac{\partial }{\partial \theta_j}J(\theta) \}$ We can work out the derivative part using calculus to get: $\{ \theta_j:=\theta_j-\frac{\alpha}{m} \sum_{i=1}^{m}(h_{\theta}(x^{(i)}-y^{(i)}) x_j^{(i)} \}$ Notice that this algorithm is identical to the one we used in linear regression. We still have to simultaneously update all values in theta.
A vectorized implementation is: $\theta:=theta-\frac{\alpha}{m}(g(X\theta)-\vec{y})]$