Coursera 吴恩达机器学习课程week1 introduction测验quiz

主要总结题型
Q1:主要是T,P,E的概念选择
A computer program is said to learn from experience E with respect to some task T and some performance measure P if its performance on T, as measured by P, improves with experience E.

Answer:
T := The weather prediction task.
P := The probability of it correctly predicting a future date’s weather.
E := The process of the algorithm examining a large amount of historical weather data.

Q2: 分类和回归的判断问题
Classification is appropriate when we are trying to predict one of a small number of discrete-valued outputs, such as whether it will rain (which we might designate as class 0), or not (say class 1).
分类:当我们试图预测一小部分离散值输出中的一个时。

Regression is appropriate when we are trying to predict a continuous-valued output, since as the price of a stock (similar to the housing prices example in the lectures).
回归:当我们试图预测一个连续值的输出时。

Q3:machine learning定义
Which of these is a reasonable definition of machine learning?
Answer:
Machine learning is the field of study that gives computers the ability to learn without being explicitly programmed.

测试题:链接参考

  1. You’re running a company, and you want to develop learning algorithms to address each of two problems. Problem 1:You have a large inventory of identical items. You want to predict how many of these items will sell over the next 3 months.
    Problem 2: You’d like software to examine individual customer accounts, and for each account decide if it has been hacked/compromised. Should you treat these as classification or as regression problems?

    A.Treat both as classification problems.

    B.Treat problem 1 as a classification problem, problem 2 as a regression problem.

    C.Treat problem 1 as a regression problem, problem 2 as a classification problem.

    D.Treat both as regression problems.

    正确答案:

    选择C。连续值为回归问题,离散值为分类问题

  2. Of the following examples, which would you address using an unsupervised learning algorithm? (Check all that apply.)

    A.Given email labeled as spam/not spam, learn a spam filter.
    B.Given a set of news articles found on the web, group them into sets of articles about the same stories.
    C.Given a database of customer data, automatically discover market segments and group customers into different market segments.
    D.Given a dataset of patients diagnosed as either having diabetes or not, learn to classify new patients as having diabetes or not.

    正确答案:

    选B,D。

    无监督学习,没有标定好的数据集,并且自动发现类别。
    给定一个数据库的客户数据,自动发现和集团客户市场的段段到不同的市场。

参考题quiz

  1. Suppose you are working on weather prediction, and use a learning algorithm to predict tomorrow’s temperature (in degrees Centigrade/Fahrenheit). Would you treat this as a classification or a regression problem?
    A. Classification

    B. Regression
    Answer:这是一道监督学习的题目,根据以往的天气,预测将来的天气,由于预测的温度是连续的值,故为回归问题,如为离散的值,则为分类问题,如预测第二天(多云,晴,阴,雨)此为分类。故此题选择回归B。

  2. Suppose you are working on stock market prediction. You would like to predict whether or not a certain company will win a patent infringement lawsuit (by training on data of companies that had to defend against similar lawsuits). Would you treat this as a classification or a regression problem?

    A. Regression

    B. Classification
    Answer:这是一道监督学习的题目,根据以往股票数据预测公司胜诉的几率,由于此题中公司只有胜诉或者败诉两种情况,故属于分类问题选择B。

  3. Some of the problems below are best addressed using a supervised learning algorithm, and the others with an unsupervised learning algorithm. Which of the following would you apply supervised learning to? (Select all that apply.) In each case, assume some appropriate dataset is available for your algorithm to learn from.
    A. Examine a large collection of emails that are known to be spam email, to discover if there are sub-types of spam mail.

    B. Given historical data of children’s ages and heights, predict children’s height as a function of their age.

    C. Given 50 articles written by male authors, and 50 articles written by female authors, learn to predict the gender of a new manuscript’s author (when the identity of this author is unknown).

    D. Take a collection of 1000 essays written on the US Economy, and find a way to automatically group these essays into a small number of groups of essays that are somehow “similar” or “related”.
    Answer:第一个根据垃圾邮件发现垃圾邮件的子类型,没有给正确答案属于分类,第四个搜集1000篇文章,没有给文章的类型,以及应该怎么分类,故属于非监督学习。PS:界定监督学习与非监督学习,看给的数据是否有想要的正确答案即可选择BC。

  • 1
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值