NG机器学习week11 Application: Photo OCR


1。

Suppose you are running a sliding window detector to find

text in images. Your input images are 1000x1000 pixels. You

will run your sliding windows detector at two scales, 10x10

and 20x20 (i.e., you will run your classifier on lots of 10x10

patches to decide if they contain text or not; and also on

lots of 20x20 patches), and you will "step" your detector by 2

pixels each time. About how many times will you end up

running your classifier on a single 1000x1000 test set image?

1,000,000 

250,000 

100,000 

500,000 

1
point
2。

Suppose that you just joined a product team that has been

developing a machine learning application, using  m=1,000

training examples. You discover that you have the option of

hiring additional personnel to help collect and label data.

You estimate that you would have to pay each of the labellers

$10 per hour, and that each labeller can label 4 examples per

minute. About how much will it cost to hire labellers to

label 10,000 new training examples?

$600 

$10,000 

$250 

$400 

1
point
3。

What are the benefits of performing a ceiling analysis? Check all that apply.

It helps us decide on allocation of resources in terms of which component in a machine learning pipeline to spend more effort on. 

It can help indicate that certain components of a system might not be worth a significant amount of work improving, because even if it had perfect performance its impact on the overall system may be small. 

It is a way of providing additional training data to the algorithm. 

If we have a low-performing component, the ceiling analysis can tell us if that component has a high bias problem or a high variance problem. 

1
point
4。

Suppose you are building an object classifier, that takes as input an image, and recognizes that image as either containing a car ( y=1 ) or not ( y=0 ). For example, here are a positive example and a negative example:

After carefully analyzing the performance of your algorithm, you conclude that you need more positive ( y=1 ) training examples. Which of the following might be a good way to get additional positive examples?

Mirror your training images across the vertical axis (so that a left-facing car now becomes a right-facing one). 

Take a few images from your training set, and add random, gaussian noise to every pixel. 

Take a training example and set a random subset of its pixel to 0 to generate a new example. 

Select two car images and average them to make a third example. 

1
point
5。

Suppose you have a PhotoOCR system, where you have the following pipeline:

You have decided to perform a ceiling analysis on this system, and find the following:

Which of the following statements are true?

If the text detection system was trained using gradient descent, running gradient descent for more iterations is unlikely to help much. 

If we conclude that the character recognition's errors are mostly due to the character recognition system having high variance, then it may be worth significant effort obtaining additional training data for character recognition. 

We should dedicate significant effort to collecting additional training data for the text detection system. 

The least promising component to work on is the character recognition system, since it is already obtaining 100% accuracy.



答案:1.D 2.D3.AB 4.A5.AB

评论 5
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

ncst

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值