Line Search Methods

最新推荐文章于 2023-08-01 15:11:01 发布

大眼呆萌君

最新推荐文章于 2023-08-01 15:11:01 发布

阅读量350

点赞数

分类专栏： Optimization

本文链接：https://blog.csdn.net/my_god2008/article/details/103535759

版权

Optimization 专栏收录该内容

11 篇文章 0 订阅

订阅专栏

重点

Armijo condition的直观理解

背景: In gradient descent algorithms, step size may be too large or too small, as shown in the figures below.

Backtracking line search

 - Initialization: alpha (=1), tau (decay rate)
 - while f(x^t + alpha p^t) ">" f(x^t)
	 alpha = tau*alpha
	end
 - Update x^{t+1} = x^t

$\alpha = \alpha * \tau$ 的作用是防止step length过小
$\textcolor{blue}{f(x^t + \alpha p^t) “>” f(x^t): \text{ prevent steps that are too long relative to the decrease in } f \text{；通过Armijo condition实现}}$

Wolfe conditions

Armijo condition

$f(x^t + \alpha^t p^t) \leq f(x^t) + \alpha^t c_1 \cdot [g^t]^T p^k,$
where $g^t$ denotes the first derivative of $f$ ; $p$ denotes the direction, $g^T p <0$ (remark: gradient descent $\Leftrightarrow p$ ).
In practice, $c_1$ is chosen quite small, say $c_1=10^{-4}$ .
In the case that $p = g$ , the Armijo condition in the 2nd step of pseudo-code step can be simplified as follows:
$\alpha \nabla(f)) > f(x) - c_1\alpha \|\nabla(f) \|_2^2 .$
*B&V book, $c_1 \in [0.01,0.03], \tau \in [0.1,0.8]$

$\textcolor{blue}{直观理解}$

require the reduction in $f$ to be at least a fixed fraction $\beta$ of the reduction promised by the first-order Taylor approximation of $f$ at $x^t$ .
aka significant decrease condition: require $\alpha$ to decrease the objective function by a significant amount.

Curvature condition

The curvature condition rules out small steps.
$\nabla f(x^t + \alpha^t p^t)^T p^t \geq c_2 \nabla f(x^t)^T p^t,$
where $c_2 \in (c_1,1)$ .
$\textcolor{gray}{\text{The condition requires that the new slope is at least } c_2 \text{ times the original gradient.}}$

图片来源

step sizes: https://people.maths.ox.ac.uk/hauser/hauser_lecture2.pdf
Figs 3.3, 3.4:
Numerical Optimization

参考文献：

https://people.maths.ox.ac.uk/hauser/hauser_lecture2.pdf
https://optimization.mccormick.northwestern.edu/index.php/Line_search_methods
Numerical Optimization

大眼呆萌君

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
Line Search Methods

重点Armijo condition的直观理解Armijo conditionstep length问题：过大或过小Backtracking line search 1. Initialization: alpha (=1), tau (decay rate) 2. while f(x^t + alpha p^t) ">" f(x^t) alpha = tau*a...
复制链接

扫一扫