Coursera Machine Learning 第十一周 quiz Application: Photo OCR

1
point
1. 

Suppose you are running a sliding window detector to find

text in images. Your input images are 1000x1000 pixels. You

will run your sliding windows detector at two scales, 10x10

and 20x20 (i.e., you will run your classifier on lots of 10x10

patches to decide if they contain text or not; and also on

lots of 20x20 patches), and you will "step" your detector by 2

pixels each time. About how many times will you end up

running your classifier on a single 1000x1000 test set image?

答案D

250,000

100,000

1,000,000

500,000

1
point
2. 

Suppose that you just joined a product team that has been

developing a machine learning application, using  m=1,000

training examples. You discover that you have the option of

hiring additional personnel to help collect and label data.

You estimate that you would have to pay each of the labellers

$10 per hour, and that each labeller can label 4 examples per

minute. About how much will it cost to hire labellers to

label 10,000 new training examples?

答案C

$600

$250

$400

$10,000

1
point
3. 

What are the benefits of performing a ceiling analysis? Check all that apply.

答案CD

If we have a low-performing component, the ceiling analysis can tell us if that component has a high bias problem or a high variance problem.

It is a way of providing additional training data to the algorithm.

It helps us decide on allocation of resources in terms of which component in a machine learning pipeline to spend more effort on.

It can help indicate that certain components of a system might not be worth a significant amount of work improving, because even if it had perfect performance its impact on the overall system may be small.

1
point
4. 

Suppose you are building an object classifier, that takes as input an image, and recognizes that image as either containing a car ( y=1 ) or not ( y=0 ). For example, here are a positive example and a negative example:

After carefully analyzing the performance of your algorithm, you conclude that you need more positive ( y=1 ) training examples. Which of the following might be a good way to get additional positive examples?

答案A

Apply translations, distortions, and rotations to the images already in your training set.

Select two car images and average them to make a third example.

Take a few images from your training set, and add random, gaussian noise to every pixel.

Make two copies of each image in the training set; this immediately doubles your training set size.

1
point
5. 

Suppose you have a PhotoOCR system, where you have the following pipeline:

You have decided to perform a ceiling analysis on this system, and find the following:

Which of the following statements are true?

答案AB

If the text detection system was trained using gradient descent, running gradient descent for more iterations is unlikely to help much.

If we conclude that the character recognition's errors are mostly due to the character recognition system having high variance, then it may be worth significant effort obtaining additional training data for character recognition.

We should dedicate significant effort to collecting additional training data for the text detection system.

The least promising component to work on is the character recognition system, since it is already obtaining 100% accuracy.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值