Mini-Batch Gradient Descent

最新推荐文章于 2024-05-17 06:30:00 发布

zhichengMLE

最新推荐文章于 2024-05-17 06:30:00 发布

阅读量840

点赞数

分类专栏：机器学习-算法机器学习算法整理文章标签： batch 算法 machine-learning gradient

本文链接：https://blog.csdn.net/jasonwoolf/article/details/78778290

版权

机器学习-算法同时被 2 个专栏收录

4 篇文章 0 订阅

订阅专栏

机器学习算法整理

4 篇文章 1 订阅

订阅专栏

Mini-Batch Gradient Descent

1. What is Mini-Batch Gradient Descent?

Mini-Batch Gradient Descent is an algorithm between the Batch Gradient Descent and Stochastic Gradient Descent. Concretly, this use some(not one or all) examples(M) for each iteration.

2. Compute Effort

The compute time of this algorithm depends on the examples. It not stable, but the worst case is like Batch Gradient Descent: O( $N^2$ )

The table below shows the different among these there Gradient Descent

Batch Gradient Descent	Mini-Batch Gradient Descent	Stochastic Gradient Descent
use 1 example in each iteration	use some examples	use all example in each iteration
relative compute loose	somewhat in between	relative compute intensive

3. Gradient Descent Formula

For all $\theta_i$

\partial J θ \partial θ i = 1 m \sum i = 1 M [h θ (x i) - y i] \cdot (x i)

$\begin{align} \frac{\partial J_\theta}{\partial \theta_i} = \frac{1}{m} \sum\limits_{i=1}^{M} \left[ h_{\theta}(x_i) - y_i \right] \cdot (x_i) \end{align}$

E.g.,
two parameters $\theta_0, \theta_1$ –> $h_{\theta}(x) = \theta_0 + \theta_1 x_1$

For i = 0 :

\partial J θ \partial θ 0 = 1 m \sum i = 1 M [h θ (x i) - y i] \cdot (x 0)

$\frac{\partial J_\theta}{\partial \theta_0} = \frac{1}{m} \sum\limits_{i=1}^{M} \left[ h_{\theta}(x_i) - y_i \right] \cdot (x_0)$

For i = 1:

\partial J θ \partial θ 1 = 1 m \sum i = 1 M [h θ (x i) - y i] \cdot (x 1)

$\frac{\partial J_\theta}{\partial \theta_1} = \frac{1}{m} \sum\limits_{i=1}^{M} \left[ h_{\theta}(x_i) - y_i \right]\cdot (x_1)$

Note that the datasets need to be shuffled before iteration.

zhichengMLE

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Mini-Batch Gradient Descent

Mini-Batch Gradient Descent1. What is Mini-Batch Gradient Descent?Mini-Batch Gradient Descent is an algorithm between the Batch Gradient Descent and Stochastic Gradient Descent. Concretly, this use som
复制链接

扫一扫

专栏目录