Andrew Ng Machine Learning 第九周

最新推荐文章于 2024-06-26 09:48:29 发布

未知丶丶

最新推荐文章于 2024-06-26 09:48:29 发布

阅读量944

点赞数 1

分类专栏：机器学习文章标签：机器学习

本文链接：https://blog.csdn.net/qq_43310834/article/details/85265825

版权

机器学习专栏收录该内容

10 篇文章 0 订阅

订阅专栏

前言

网易云课堂（双语字幕，不卡）：https://study.163.com/course/courseMain.htm?courseId=1004570029
Coursera：https://www.coursera.org/learn/machine-learning
本人初学者，先在网易云课堂上看网课，再去Coursera上做作业，开博客以记录，文章中引用图片皆为课程中所截。

异常检测

1.目标动机

在这里插入图片描述
Tips：判断新的x_test是否是异常数据

Tips：由现有的训练集拟合出一个p(x)模型，将p(x_test)代入模型，将结果与ε对比，若小于ε，即说明新的点落在该模型的概率过小，即为异常

2.高斯分布（正态分布）

在这里插入图片描述

3.算法

Tips：假设p(x)拟合为高斯分布
在这里插入图片描述
Tips：对于每个特征x，求出相应的高斯分布参数μ和σ，最后将每个p(x_i)相乘，得出最后的p(x)

4.开发和评估异常检测

在这里插入图片描述

Tips：以上为假设情况，假设分配情况如上

Tips：在训练集上模型p(x)，在交叉集或者测试集上测试当前p(x)情况

Tips：评估方法在第六周笔记，同样ε也用这种方法来决定

5.异常检测VS监督学习

在这里插入图片描述
Tips：简单来说，异常检测情况是当正样本很少负样本很大的时候或者出现异常的情况很多的时候使用

6.选择要使用的功能

(1)特征高斯化

在这里插入图片描述

(2)误差分析

在这里插入图片描述
Tips：用从训练集中得到的p(x)去在交叉集上验证，将其中验证有误差的样本人为的挑选出来，并且根据特征判断出是否应该有新的特征

7.多变量高斯分布

在这里插入图片描述
Tips：简单来说，本来CPU Load和Memory Use应该是线性关系，所以我们需要的数据模型p(x)应该是个椭圆，可是按照上述方法，p(x)所拟合的永远会是一种圆形，则无法判断出它的异常性

8.使用多变量高斯分布的异常检测

在这里插入图片描述

题目

1.Question 1

For which of the following problems would anomaly detection be a suitable algorithm?
在这里插入图片描述解答：AB

2.Question 2

Suppose you have trained an anomaly detection system for fraud detection, and your system that flags anomalies when p(x)p(x) is less than \varepsilonε, and you find on the cross-validation set that it is missing many fradulent transactions (i.e., failing to flag them as anomalies). What should you do?
在这里插入图片描述
解答：A

3.Question 3

Suppose you are developing an anomaly detection system to catch manufacturing defects in airplane engines. You model uses
在这里插入图片描述

解答：B

4.Question 4

Which of the following are true? Check all that apply.
在这里插入图片描述
解答：CD
（对于A，classfication accuracy在倾斜集上效果很差）

5.Question 5

You have a 1-D dataset {x⁽¹⁾…x^(m)}and you want to detect outliers in the dataset. You first plot the dataset and it looks like this:
在这里插入图片描述
Suppose you fit the gaussian distribution parameters μ₁ and σ₁² to this dataset. Which of the following values for μ₁ and σ₁² might you get?

解答：A

6.Question 6

Suppose you run a bookstore, and have ratings (1 to 5 stars) of books. Your collaborative filtering algorithm has learned a parameter vector θ ^(j)for user jj, and a feature vector x⁽ⁱ⁾ for each book. You would like to compute the “training error”, meaning the average squared error of your system’s predictions on all the ratings that you have gotten from your users. Which of these are correct ways of doing so (check all that apply)? For this problem, let mm be the total number of ratings you have gotten from your users. (Another way of saying this is
在这里插入图片描述
解答：AC

7.Question 7

In which of the following situations will a collaborative filtering system be the most appropriate learning algorithm (compared to linear or logistic regression)?
在这里插入图片描述
解答：CD
(对于B，用户只有一个，不用推荐系统）

8.Question 8

You run a movie empire, and want to build a movie recommendation system based on collaborative filtering. There were three popular review websites (which we’ll call A, B and C) which users to go to rate movies, and you have just acquired all three companies that run these websites. You’d like to merge the three companies’ datasets together to build a single/unified system. On website A, users rank a movie as having 1 through 5 stars. On website B, users rank on a scale of 1 - 10, and decimal values (e.g., 7.5) are allowed. On website C, the ratings are from 1 to 100. You also have enough information to identify users/movies on one website with users/movies on a different website. Which of the following statements is true?
在这里插入图片描述
解答：D

9.Question 9

Which of the following are true of collaborative filtering systems? Check all that apply.
在这里插入图片描述
解答：AB

10.Question 10

Suppose you have two matrices A and B, where A is 5x3 and B is 3x5. Their product is C = AB, a 5x5 matrix. Furthermore, you have a 5x5 matrix RR where every entry is 0 or 1. You want to find the sum of all elements C(i,j) for which the corresponding R(i,j) is 1, and ignore all elements C(i,j)where R(i,j) = 0. One way to do so is the following code:
Which of the following pieces of Octave code will also correctly compute this total? Check all that apply. Assume all options are in code.
在这里插入图片描述
解答：AB
（对于C，如果是A*B.*R就是对的）

未知丶丶

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
2
评论
Andrew Ng Machine Learning 第九周

Andrew Ng Machine Learning 第九周前言异常检测1.目标动机2.高斯分布（正态分布）3.算法4.开发和评估异常检测5.异常检测VS监督学习6.选择要使用的功能(1)特征高斯化(2)误差分析7.多变量高斯分布8.使用多变量高斯分布的异常检测题目1.Question 12.Question 23.Question 34.Question 45.Question 5前言网易云...
复制链接

扫一扫