Andrew Ng 's machine learning lecture note (15)

FrostMonarch

于 2018-06-07 12:12:23 发布

阅读量297

点赞数

分类专栏： Andrew Ng 's Note

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/FrostMonarch/article/details/80386529

版权

Andrew Ng 's Note 专栏收录该内容

15 篇文章 0 订阅

订阅专栏

Anomaly detection

This algorithm can help us to realize that whether some data sets are abnormal. We should follow the steps:

(1)Choose the features that you think may relate to anomalous examples

(2)

we can separate the data set into 3 parts. 60% training set(all samples should be not anomalous ), 20% cross validation set (50% anomalous samples inside), 20% test set(the other 50% anomalous samples inside).

The traning set is used to compute P(x) above. The cross validation set is used to verify the modle. Because, our data is skewed (just small number of anomalous samples) , the cost should be F1 score.

Anomaly detection VS supervised learning

Because, supervised learning is quite similar to anomaly detection, it's necessary to know which should be used in different situations.

Anomaly detection should be used in the following 2 situations.

(1) The future anomalous sample has a quite different feature than the sample in the data set.

(2) Anomalous samples are very small amount(10 - 50 ).

After we choose our features and find that our anomalous point is near not anomalous points, we'd better choose one more feature.

Recommended content based system using machine learning

There're several steps we need to follow:

(1) Feature vector

(2) And we need to minimize the cost function

It's awesome that we can use linear regression to solve this problem.

Summary

Our goal is to minimize this cost function, When we are minimizing x, theta should be constant. While minimizing theta, x should be constant. Remember i refers to the movie while j refers to the user.

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Andrew Ng 's machine learning lecture note (15)

Anomaly detectionThis algorithm can help us to realize that whether some data sets are abnormal. We should follow the steps:(1)Choose the features that you think may relate to anomalous examples(2)(3)
复制链接

扫一扫

专栏目录

博客等级

码龄9年

222
原创

27
点赞

82
收藏

10
粉丝

关注

私信

热门文章

分类专栏

最新评论

IEEE极限编程总结（UESTC）
i153ad: 大佬这个比赛您是怎么准备的呐请问需要哪方面的知识
IEEE极限编程总结（UESTC）
FrostMonarch: 我记得木有
IEEE极限编程总结（UESTC）
FOWng_lp: 所以这个比赛没有罚时嘛，可以随便交嘛
怎么运行github仓库的BlurGan
十一qx: 你好运行出现下面的问题 Traceback (most recent call last): File "C:/Users/PC/Desktop/DeblurGAN-modified/DeblurGAN-master/test.py", line 28, in <module> data_loader = CreateDataLoader(opt) File "C:\Users\PC\Desktop\DeblurGAN-modified\DeblurGAN-master\data\data_loader.py", line 4, in CreateDataLoader data_loader = CustomDatasetDataLoader(opt) File "C:\Users\PC\Desktop\DeblurGAN-modified\DeblurGAN-master\data\custom_dataset_data_loader.py", line 32, in __init__ self.dataset = CreateDataset(opt) File "C:\Users\PC\Desktop\DeblurGAN-modified\DeblurGAN-master\data\custom_dataset_data_loader.py", line 21, in CreateDataset dataset.initialize(opt) File "F:\anaconda2\envs\pytorch\lib\site-packages\torch\utils\data\dataset.py", line 83, in __getattr__ raise AttributeError AttributeError 这个怎么改啊
原地合并两个排序数组 O（1）空间复杂度，O(n)时间复杂度
FrostMonarch: 代码的第14行，重新维护使它有序。

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。