Large Scale Machine Learning

最新推荐文章于 2022-06-09 20:54:01 发布

crella___

最新推荐文章于 2022-06-09 20:54:01 发布

阅读量361

点赞数

分类专栏：机器学习

本文链接：https://blog.csdn.net/crella___/article/details/61423823

版权

机器学习专栏收录该内容

26 篇文章 0 订阅

订阅专栏

这里写图片描述

Before learning with large datasets, plot a learning curve with a much small datasets, and to see if it appears to be have high bias.

Stochastic Gradient Decent
(taking linear regression for example)

这里写图片描述

(Repeat between 1 to 10 times)

as the picture shows, it will converge in random directions and fail to converge into a single point, but much more effective and faster, compared with Batch Gradient Descent.

Mini-Batch Gradient Descent

这里写图片描述

b is often chosen from 2 to 100

Mini-Batch will be faster than Stochastic only by vetorization.

Checking for Converge
(without scanning over the entire training set periodically)
这里写图片描述
In plot 1, with a smaller $\alpha$ , the red line converge better, however, slower.
In plot 2 and 3, with a larger examples, the red line looks smoother, which do help to plot 3 confronted too much noise. Similarly, it will be slower.
In plot 4, with a increasing plot, maybe we should using a smaller $\alpha$ instead.