NG机器学习week11 Application: Photo OCR

最新推荐文章于 2023-08-24 11:17:33 发布

ncst

最新推荐文章于 2023-08-24 11:17:33 发布

阅读量1.9k

点赞数

分类专栏：机器学习文章标签： class cnn c语言 github it

本文链接：https://blog.csdn.net/sinat_32547403/article/details/76982355

版权

机器学习专栏收录该内容

46 篇文章 2 订阅

订阅专栏

1。

Suppose you are running a sliding window detector to find

text in images. Your input images are 1000x1000 pixels. You

will run your sliding windows detector at two scales, 10x10

and 20x20 (i.e., you will run your classifier on lots of 10x10

patches to decide if they contain text or not; and also on

lots of 20x20 patches), and you will "step" your detector by 2

pixels each time. About how many times will you end up

running your classifier on a single 1000x1000 test set image?

1,000,000

250,000

100,000

500,000

1
point

2。

Suppose that you just joined a product team that has been

developing a machine learning application, using m=1,000

training examples. You discover that you have the option of

hiring additional personnel to help collect and label data.

You estimate that you would have to pay each of the labellers

$10 per hour, and that each labeller can label 4 examples per

minute. About how much will it cost to hire labellers to

label 10,000 new training examples?

$600

$10,000

$250

$400

1
point

3。

What are the benefits of performing a ceiling analysis? Check all that apply.

It helps us decide on allocation of resources in terms of which component in a machine learning pipeline to spend more effort on.

It can help indicate that certain components of a system might not be worth a significant amount of work improving, because even if it had perfect performance its impact on the overall system may be small.

It is a way of providing additional training data to the algorithm.

If we have a low-performing component, the ceiling analysis can tell us if that component has a high bias problem or a high variance problem.

1
point

4。

Suppose you are building an object classifier, that takes as input an image, and recognizes that image as either containing a car ( y=1 ) or not ( y=0 ). For example, here are a positive example and a negative example:

After carefully analyzing the performance of your algorithm, you conclude that you need more positive ( y=1 ) training examples. Which of the following might be a good way to get additional positive examples?

Mirror your training images across the vertical axis (so that a left-facing car now becomes a right-facing one).

Take a few images from your training set, and add random, gaussian noise to every pixel.

Take a training example and set a random subset of its pixel to 0 to generate a new example.

Select two car images and average them to make a third example.

1
point

5。

Suppose you have a PhotoOCR system, where you have the following pipeline:

You have decided to perform a ceiling analysis on this system, and find the following:

Which of the following statements are true?

If the text detection system was trained using gradient descent, running gradient descent for more iterations is unlikely to help much.

If we conclude that the character recognition's errors are mostly due to the character recognition system having high variance, then it may be worth significant effort obtaining additional training data for character recognition.

We should dedicate significant effort to collecting additional training data for the text detection system.

The least promising component to work on is the character recognition system, since it is already obtaining 100% accuracy.

答案：1.D 2.D3.AB 4.A5.AB