这些是我自己做题的思路,但是吧,本人能力和水平有限,存在理解有问题却不自知的情况,望大家帮忙指正,思路仅供参考哈
第一题
A computer program is said to learn from experience E with respect to some task T and some performance measure P if its performance on T, as measured by P, improves with experience E. Suppose we feed a learning algorithm a lot of historical weather data, and have it learn to predict weather. In this setting, what is T?
A:The weather prediction task.
B:None of these.
C:The process of the algorithm examining a large amount of historical weather data.
D:The probability of it correctly predicting a future date’s weather.
选A。C为历史经验E,D为性能度量P
第二题
The amount of rain that falls in a day is usually measured in either millimeters (mm) or inches. Suppose you use a learning algorithm to predict how much rain will fall tomorrow. Would you treat this as a classification or a regression problem?
A:Classification
B:Regression
选B。问的多少降雨,我个人理解为一个回归问题,通过前几天的数据,预测明天的雨量。
第三题
Suppose you are working on stock market prediction. You would like to predict whether or not a certain company will declare bankruptcy within the next 7 days (by training on data of similar companies that had previously been at risk of bankruptcy). Would you treat this as a classification or a regression problem?
A:Classification
B:Regression
选A。通过对以前数据的对比分析,分析一个公司在接下来的7天要么破产,要么不破产,应该是一个分类问题。
第三题
Some of the problems below are best addressed using a supervised learning algorithm, and the others with an unsupervised learning algorithm. Which of the following would you apply supervised learning to? (Select all that apply.) In each case, assume some appropriate dataset is available for your algorithm to learn from.
A:Examine the statistics of two football teams, and predict which team will win tomorrow’s match (given historical data of teams’ wins/losses to learn from).
B:Examine a large collection of emails that are known to be spam email, to discover if there are sub-types of spam mail.
C:Given genetic (DNA) data from a person, predict the odds of him/her developing diabetes over the next 10 years.
D:Take a collection of 1000 essays written on the US Economy, and find a way to automatically group these essays into a small number of groups of essays that are somehow “similar” or “related”.
选A,B。
A球队往期比赛的数据都有记录,会有胜负标签
B垃圾邮件分类,有对邮件分类的标签
C没想通
D无监督学习,自动找出数据结构
答案错误,B可以肯定是对的,那就是BC了,原因没想通,A也迷惑了
第五题
Which of these is a reasonable definition of machine learning?
A:Machine learning is the field of allowing robots to act intelligently.
B:Machine learning is the field of study that gives computers the ability to learn without being explicitly programmed.
C:Machine learning is the science of programming computers.
D:Machine learning learns from labeled data.
选B。机器学习就是无需明确编程即可自己学习的研究。