Coursera Machine Learning 第六周 quiz Machine Learning System Design

最新推荐文章于 2021-01-20 22:14:26 发布

OovEver

最新推荐文章于 2021-01-20 22:14:26 发布

阅读量1.2w

点赞数 2

分类专栏： Machine Learning 文章标签： Machine Learning Coursera Andrew Ng week6 SystemDesign

本文链接：https://blog.csdn.net/mupengfei6688/article/details/53141212

版权

Machine Learning 专栏收录该内容

26 篇文章 7 订阅

订阅专栏

有用就点个赞吧

1
point

You are working on a spam classification system using regularized logistic regression. "Spam" is a positive class (y = 1) and "not spam" is the negative class (y = 0). You have trained your classifier and there are m = 1000 examples in the cross-validation set. The chart of predicted class vs. actual class is:

	Actual Class: 1	Actual Class: 0
Predicted Class: 1	85	890
Predicted Class: 0	15	10

For reference:

Accuracy = (true positives + true negatives) / (total examples)
Precision = (true positives) / (true positives + false positives)
Recall = (true positives) / (true positives + false negatives)
F1 score = (2 * precision * recall) / (precision + recall)

What is the classifier's accuracy (as a value from 0 to 1)?

Enter your answer in the box below. If necessary, provide at least two values after the decimal point.

答案 0.095

1
point

Suppose a massive dataset is available for training a learning algorithm. Training on a lot of data is likely to give good performance when two of the following conditions hold true.

Which are the two?

答案AB

Our learning algorithm is able to

represent fairly complex functions (for example, if we

train a neural network or other model with a large

number of parameters).

A human expert on the application domain

can confidently predict y when given only the features x

(or more generally, if we have some way to be confident

that x contains sufficient information to predict y

accurately).

When we are willing to include high

order polynomial features of x (such as x21 , x22 ,

x1x2 , etc.).

The classes are not too skewed.

1
point

Suppose you have trained a logistic regression classifier which is outputing hθ(x) .

Currently, you predict 1 if hθ(x)≥threshold , and predict 0 if hθ(x)<threshold , where currently the threshold is set to 0.5.

Suppose you decrease the threshold to 0.1. Which of the following are true? Check all that apply.

答案D

The classifier is likely to have unchanged precision and recall, and

thus the same F1 score.

The classifier is likely to have unchanged precision and recall, but

higher accuracy.

The classifier is likely to now have lower recall.

The classifier is likely to now have lower precision.

1
point

Suppose you are working on a spam classifier, where spam

emails are positive examples ( y=1 ) and non-spam emails are

negative examples ( y=0 ). You have a training set of emails

in which 99% of the emails are non-spam and the other 1% is

spam. Which of the following statements are true? Check all

that apply.

答案BCD

If you always predict non-spam (output

y=0 ), your classifier will have 99% accuracy on the

training set, but it will do much worse on the cross

validation set because it has overfit the training

data.

A good classifier should have both a

high precision and high recall on the cross validation

set.

If you always predict non-spam (output

y=0 ), your classifier will have 99% accuracy on the

training set, and it will likely perform similarly on

the cross validation set.

If you always predict non-spam (output

y=0 ), your classifier will have an accuracy of

99%.

1
point

Which of the following statements are true? Check all that apply.

答案AD

The "error analysis" process of manually

examining the examples which your algorithm got wrong

can help suggest what are good steps to take (e.g.,

developing new features) to improve your algorithm's

performance.

It is a good idea to spend a lot of time

collecting a large amount of data before building

your first version of a learning algorithm.

If your model is underfitting the

training set, then obtaining more data is likely to

help.

After training a logistic regression

classifier, you must use 0.5 as your threshold

for predicting whether an example is positive or

negative.

Using a very large training set

makes it unlikely for model to overfit the training

data.

OovEver

关注

2
点赞
踩
0

收藏

觉得还不错? 一键收藏
3
评论
Coursera Machine Learning 第六周 quiz Machine Learning System Design

有用就点个赞吧1point1. You are working on a spam classification system using regularized logistic regression. "Spam" is a positive class (y = 1) and "not spam" is the negative class
复制链接

扫一扫

专栏目录