吴恩达·Machine Learning || chap15 Anomaly detection简记

最新推荐文章于 2022-07-29 11:23:13 发布

The Prestige

最新推荐文章于 2022-07-29 11:23:13 发布

阅读量127

点赞数

分类专栏： Machine Learning 文章标签：机器学习

本文链接：https://blog.csdn.net/qq_46203130/article/details/120227993

版权

Machine Learning 专栏收录该内容

17 篇文章 0 订阅

订阅专栏

15 Anomaly detection

15-1 Problem motivation

Anomaly detection example

Aircraft engine features:

$x_1$ =heat generated

$x_2$ =vibration intensity

$\cdots$

Dataset: $\{ x ^ { ( 1 ) } , x ^ { ( 2 ) } , \cdots , x ^ { ( m ) } \}$

New engine : $x_{test}$

Density estimation

Dataset: $\{ x ^ { ( 1 ) } , x ^ { ( 2 ) } , \cdots , x ^ { ( m ) } \}$
Is $x_{test}\quad anomalous?$

Example

Fraud detection:
$x^{(i)}$ = features of user i’s activities
Model p(x) from data
Identify unusual users by checking which have $p(x)<\epsilon$

Manufacturing

Monitoring computers in a data center

$x^{(i)}$ = features of machine i
$x_1$ =memory use , $x_2$ =number of disk accesses/sec
$x_3$ =CPU load , $x_4$ =CPU load/network traffic

15-2 Gaussian distribution

Gaussian (Normal) distribution

Say $x\in\mathbb{R}$ . If $x$ is a distribution Gaussian with mean $\mu$ ,variance $\sigma^{2}$

$x\sim N(\mu,\sigma^2)$

$p(x;\mu,\sigma^2)=\frac{1}{\sqrt{2\pi}\sigma}e^{(-\frac{(x-\mu)^2}{2\sigma^2})}$

σ larger，image wider

Parameter estimation

Dataset: $\{ x ^ { ( 1 ) } , x ^ { ( 2 ) } , \cdots , x ^ { ( m ) } \}$ $x^{(i)}\in \mathbb{R}$

15-3 Algorithm

Density estimation

Training set: $x^{(1)},\cdots,x^{(m)}$

Each example is $x\in \mathbb{R}^n$

Anomaly detection algorithm

Choose features $x_i$ that you think might be indicative of
anomalous examples.
Fit parameters

$\mu_1,\cdots,\mu_n,\sigma_1^2,\cdots,\sigma_n^2$

$\mu _ { j } = \frac { 1 } { m } \sum _ { i = 1 } ^ { m } x _ { j } ^ { ( i ) }$

$\sigma _ { j } ^ { 2 } = \frac { 1 } { m } \sum _ { i = 1 } ^ { m } ( x _ { i } ^ { ( i ) } - \mu _ { j } ) ^ { 2 }$
Given new example x, compute p(x):

$\prod _ { j = 1 } ^ { n } p ( x _ { j } ; u _ { j } , \sigma _ { j } ^ { 2 } ) = \prod _ { j=1 } ^ { n } \frac { 1 } { \sqrt { 2 \pi } \sigma _ { j } } e x p ( - \frac { ( x _ { j } - u _ { j } ) ^ { 2 } } { 2 \sigma _ { j } } )$

Anomaly if $p(x)<\epsilon$

Anomaly detection example
在这里插入图片描述

15-4 Developing and evaluating an anomaly detection system

The importance of real-number evaluation

When developing a learning algorithm(choosing features, etc. ), making decisions is much easier if we have a way of evaluating our learning algorithm

Assume we have some labeled data of anomalous and non anomalous examples. (y=0 if normal, y=1 if anomalous

Training set: $x^{(1)},x^{(2)},\cdots,x^{(m)}$ (assume normal examples/not anomalous)

Cross validation set : $x_{cv} ^ { ( 1 ) } , y_{cv} ^ { ( 1 ) } ) , \cdots , ( x_{cv} ^ { ( m _ { c v } ) } , y_{cv} ^ { ( m _ { c v } ) } )$

Test set : $x_{test} ^ { ( 1 ) } , y_{test} ^ { ( 1 ) } ) , \cdots , ( x_{test} ^ { ( m _ { test } ) } , y_{test} ^ { ( m _ { test } ) } )$

Aircraft engines motivation example

10000 good (normal) engines

20 flawed engines (anomalous)

Training set : 6000 good engines

CV: 2000 good engines(y=0),10 anomalous (y=1)

Test: 2000 good engines(y=0),10 anomalous(y=1)

or Alternative

Algorithm evaluation

Fit model $p (x)$ on training set $\{x^{(1)},\cdots,x^{(m)}\}$

On a cross validation/test example predict
$y=\begin{cases}1\quad if\;p(x)<\epsilon\;(anomaly)\\0\quad if\;p(x)\ge\epsilon\;(normal) \end{cases}$
Possible evaluation metrics:

True positive, false positive, false negative, true negative
Precision/Recall
$F_1$ -score

Can also use cross validation set to choose parameter $\epsilon$

15-5 Anomaly detection vs. supervised learning

在这里插入图片描述

15-6 Choosing what features to use

Non-gaussian features

make the data look a bit more Gaussian

Error analysis for anomaly detection

Want $p (x)$ large for normal examples $x$
$p (x)$ small for anomalous examples a
Most common problem:
$p (x)$ is comparable (say, both large) for normal
and anomalous examples

**Monitoring computers in a data center **

Choose features that might take on unusually large or small values in the event of an anomaly
$x_1$ = memory use of computer
$x_1$ =number of disk accesses/sec
$x_1$ =CPU load
$x_1$ =network traffic

15-7 Multivariate Gaussian distribution

Motivating example: Monitoring machines in a data center

Multivariate Gaussian (Normal) distribution

$x\in\mathbb{R}^n$ .Don’t model $p(x_1),p(x_2),\cdots,etc.$ separately.
Model $p (x)$ all in one go.
Parameters: $\mu\in\mathbb{R}^n,\Sigma\in\mathbb{R}^{n\times n}\;(covariance\;matrix)$

在这里插入图片描述

Multivariate Gaussian (Normal) examples

在这里插入图片描述

15-8 Anomaly detection using the multivariate Gaussian distribution

Multivariate Gaussian (Normal)distribution

Parameters $\mu,\Sigma$

$\mu , \Sigma ) = \frac { 1 } { ( 2 \pi ) ^ { \frac { n } { 2 } } | \Sigma ^ { \frac { 1 } { 2 } } |}e^{(- \frac { 1 } { 2 } ( x - \mu ) ^ { T } \Sigma ^ { - 1 } ( x - \mu ))}$

Parameter fitting:

Given training set $\{ x ^ { ( 1 ) } , x ^ { ( 2 ) } , \cdots , x ^ { ( m ) } \}$

$\frac { 1 } { m } \sum _ { i = 1 } ^ { m } x ^ { ( i ) }$

$\Sigma=\frac{1}{m} \sum _ { i = 1 } ^ { m } ( x ^ { ( i ) } - \mu ) ( x ^ { ( i ) } - \mu ) ^ { T }$

Anomaly detection with the multivariate Gaussian

Fit model $p (x)$ by setting

$\frac { 1 } { m } \sum _ { i = 1 } ^ { m } x ^ { ( i ) }$

$\Sigma=\frac{1}{m} \sum _ { i = 1 } ^ { m } ( x ^ { ( i ) } - \mu ) ( x ^ { ( i ) } - \mu ) ^ { T }$
Given a new example x,compute

$\mu , \Sigma ) = \frac { 1 } { ( 2 \pi ) ^ { \frac { n } { 2 } } | \Sigma ^ { \frac { 1 } { 2 } } |}e^{(- \frac { 1 } { 2 } ( x - \mu ) ^ { T } \Sigma ^ { - 1 } ( x - \mu ))}$
Flag an anomaly if $p(x)<\epsilon$

Relationship to original model

Original model: $\mu _ { 1 } , \sigma _ { 1 } ^ { 2 } ) \times p ( x _ { 2 } ; \mu _ { 2 } , \sigma _ { 2 } ^ { 2 } ) \times \cdots \times p ( x _ { n } ; \mu _ { n } , \sigma_n^2 )$

Corresponds to multivariate Gaussian

$\mu , \Sigma ) = \frac { 1 } { ( 2 \pi ) ^ { \frac { n } { 2 } } | \Sigma|^{ \frac { 1 } { 2 } } } e x p ( - \frac { 1 } { 2 } ( x - \mu ) ^ { T } \Sigma ^ { - 1 } ( x - \mu ) )$

where
在这里插入图片描述

The Prestige

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
吴恩达·Machine Learning || chap15 Anomaly detection简记

15 Anomaly detection15-1 Problem motivationAnomaly detection exampleAircraft engine features: x1x_1x1=heat generated x2x_2x2=vibration intensity ⋯\cdots⋯Dataset: {x(1),x(2),⋯ ,x(m)}\{ x ^ { ( 1 ) } , x ^ { ( 2 ) } , \cdots , x ^ { ( m ) } \}{x(
复制链接

扫一扫

专栏目录