machine learning error correction

最新推荐文章于 2018-05-08 16:19:04 发布

huazhenrea

最新推荐文章于 2018-05-08 16:19:04 发布

阅读量7.1k

点赞数

分类专栏：作业纠错机器学习文章标签：机器学习 coursera 作业纠错

本文链接：https://blog.csdn.net/huazhenrea/article/details/52567041

版权

机器学习同时被 2 个专栏收录

2 篇文章 0 订阅

订阅专栏

作业纠错

1 篇文章 0 订阅

订阅专栏

1, Suppose you have the following training set, and fit a logistic regression classifier

hθ(x)=g(θ0+θ1x1+θ2x2)

Which of the following are true? Check all that apply.

J(θ) will be a convex function, so gradient descent should converge to the global minimum.

convex function is a quandratic function.

Adding polynomial features (e.g., instead using hθ(x)=g(θ0+θ1x1+θ2x2+θ3x21+θ4x1x2+θ5x22) ) could increase how well we can fit the training data.

The positive and negative examples cannot be separated using a straight line. So, gradient descent will fail to converge.

[EX] the positive and negative examples cannot be separeted using a straight line, but when using the polynomial models , the gradient descent will still effective to converge

Because the positive and negative examples cannot be separated using a straight line, linear regression will perform as well as logistic regression on this data.

[EX ]linear regression often do not work well in classification problems.

2. Which of the following statements are true? Check all that apply.

The cost function J(θ) for logistic regression trained with m≥1 examples is always greater than or equal to zero.

For logistic regression, sometimes gradient descent will converge to a local minimum (and fail to find the global minimum). This is the reason we prefer more advanced optimization algorithms such as fminunc (conjugate gradient/BFGS/L-BFGS/etc).

[] not for this reason, those three ads faster than gradient descent and you don't need to manully pick alpha.

The one-vs-all technique allows you to use logistic regression for problems in which each y(i) comes from a fixed, discrete set of values.