吴恩达Machine Learning week 3 review答案: Logistic Regression

1。

Suppose that you have trained a logistic regression classifier, and it outputs on a new example  x  a prediction  hθ(x)  = 0.4. This means (check all that apply):

[Y]Our estimate for  P(y=0|x;θ)  is 0.6.

[Y]Our estimate for  P(y=1|x;θ)  is 0.4.

Our estimate for  P(y=1|x;θ)  is 0.6.

Our estimate for  P(y=0|x;θ)  is 0.4.

1
point
2。

θ(x)=g(θ0+θ1x1+θ2

Which of the following are true? Check all that apply.

J(θ) will be a convex function, so gradient descent should converge to the global minimum.

【right】 Adding polynomial features (e.g., instead using hθ(x)=g(θ0+θ1x1+θ2x2+θ3x21+θ4x1x2+θ5x22) ) could increase how well we can fit the training data.

【right】The positive and negative examples cannot be separated using a straight line. So, gradient descent will fail to converge.

 WRONG Because the positive and negative examples cannot be separated using a straight line, linear regression will perform as well as logistic regression on this data.


1
point
3。

For logistic regression, the gradient is given by  θjJ(θ)=1mmi=1(hθ(x(i))y(i))x(i)j . Which of these is a correct gradient descent update for logistic regression with a learning rate of  α ? Check all that apply.

θj:=θjα1mmi=1(hθ(x(i))y(i))x(i)  (simultaneously update for all  j ).

[Y] θj:=θjα1mmi=1(11+eθTx(i)y(i))x(i)j  (simultaneously update for all  j ).

θ:=θα1mmi=1(θTxy(i))x(i) .

θj:=θjα1mmi=1(hθ(x(i))y(i))x(i)j  (simultaneously update for all  j ).

1
point
4。

Which of the following statements are true? Check all that apply.

[Y]The cost function  J(θ)  for logistic regression trained with  m1  examples is always greater than or equal to zero.

For logistic regression, sometimes gradient descent will converge to a local minimum (and fail to find the global minimum). This is the reason we prefer more advanced optimization algorithms such as fminunc (conjugate gradient/BFGS/L-BFGS/etc).

Since we train one classifier when there are two classes, we train two classifiers when there are three classes (and we do one-vs-all classification).

[Y]The one-vs-all technique allows you to use logistic regression for problems in which each  y(i)  comes from a fixed, discrete set of values.

1
point
5。

Suppose you train a logistic classifier  hθ(x)=g(θ0+θ1x1+θ2x2) . Suppose  θ0=6,θ1=0,θ2=1 . Which of the following figures represents the decision boundary found by your classifier?

Figure:

Figure:

Figure:

[Y]Figure:


  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值