吴恩达机器学习——我的错题集

最新推荐文章于 2024-08-18 23:42:17 发布

Alinawly

最新推荐文章于 2024-08-18 23:42:17 发布

阅读量1.7k

点赞数

分类专栏：机器学习文章标签：机器学习

本文链接：https://blog.csdn.net/Alinawly/article/details/78340739

版权

机器学习专栏收录该内容

3 篇文章 1 订阅

订阅专栏

第二周

Which of the following are reasons for using feature scaling?

It speeds up gradient descent by making it require fewer iterations to get to a good solution

.It speeds up gradient descent by making each iteration of gradient descent less expensive to compute

第三周

2. Suppose you have the following training set, and fit a logistic regression classifier

hθ(x)=g(θ0+θ1x1+θ2x2).

Which of the following are true? Check all that apply.

Adding polynomial features (e.g., instead using hθ(x)=g(θ0+θ1x1+θ2x2+θ3x21+θ4x1x2+θ5x22) ) could increase how well we can fit the training data.

At the optimal value of θ (e.g., found by fminunc), we will have J(θ)≥0.

Adding polynomial features (e.g., instead using hθ(x)=g(θ0+θ1x1+θ2x2+θ3x21+θ4x1x2+θ5x22) ) would increase J(θ)because we are now summing over more terms.

If we train gradient descent for enough iterations, for some examples

x(i) in the training set it is possible to obtain hθ(x(i))>1.

选AB

3.For logistic regression, the gradient is given by ∂∂θjJ(θ)=∑mi=1(hθ(x(i))−y(i))x(i)j. Which of these is a correct gradient descent update for logistic regression with a learning rate of α? Check all that apply.