Introduction to nonlinear optimization第五章习题

Nightmare004

已于 2022-03-31 13:52:09 修改

阅读量762

点赞数 1

分类专栏：数学文章标签：线性代数

于 2022-02-06 15:52:50 首次发布

本文链接：https://blog.csdn.net/qq_39942341/article/details/122794854

版权

数学专栏收录该内容

143 篇文章 18 订阅

订阅专栏

5.1. Find without MATLAB the Cholesky factorization of the matrix
$\mathbf{A}=\left(\begin{array}{cccc} 1 & 2 & 4 & 7 \\ 2 & 13 & 23 & 38 \\ 4 & 23 & 77 & 122 \\ 7 & 38 & 122 & 294\\ \end{array}\right)$
解：
设
$\mathbf{L}=\begin{pmatrix} l_{11}&0&0&0\\ l_{21}&l_{22}&0&0\\ l_{31}&l_{32}&l_{33}&0\\ l_{41}&l_{42}&l_{43}&l_{44}\\ \end{pmatrix}$
$l_{11}=\sqrt{a_{11}}=1$
$\begin{pmatrix} l_{21}\\ l_{31}\\ l_{41}\\ \end{pmatrix}=\frac{1}{\sqrt{1}}\begin{pmatrix} 2\\ 4\\ 7\\ \end{pmatrix}=\begin{pmatrix} 2\\ 4\\ 7\\ \end{pmatrix}$
$\mathbf{L}'=\begin{pmatrix} l_{22}&0&0\\ l_{32}&l_{33}&0\\ l_{42}&l_{43}&l_{44}\\ \end{pmatrix}$
$\mathbf{L}'\mathbf{L}'^T=\begin{pmatrix} 13 & 23 & 38 \\ 23 & 77 & 122 \\ 38 & 122 & 294\\ \end{pmatrix}-\frac{1}{1}\begin{pmatrix}2\\ 4\\ 7\\\end{pmatrix}\begin{pmatrix}2\\ 4\\ 7\\\end{pmatrix}^T=\begin{pmatrix} 9&15&24\\ 15&61&94\\ 24&94&245\\ \end{pmatrix}$
$l_{22}=\sqrt{9}=3$
$\begin{pmatrix} l_{32}\\ l_{42}\\ \end{pmatrix}=\frac{1}{\sqrt{9}}\begin{pmatrix} 15\\ 24\\ \end{pmatrix}=\begin{pmatrix} 5\\ 8\\ \end{pmatrix}$
$\mathbf{L}''=\begin{pmatrix} l_{33}&0\\ l_{43}&l_{44}\\ \end{pmatrix}$
$\mathbf{L}''\mathbf{L}''^T=\begin{pmatrix} 61&94\\ 94&245\\ \end{pmatrix}-\frac{1}{9}\begin{pmatrix} 15\\ 24\\ \end{pmatrix}\begin{pmatrix} 15\\ 24\\ \end{pmatrix}^T=\begin{pmatrix} 36&54\\ 54&181\\ \end{pmatrix}$
$l_{33}=\sqrt{36}=6$
$l_{43}=\frac{54}{\sqrt{36}}=9$
$l_{44}^2=181-\frac{54\cdot 54}{36}=100\Rightarrow l_{44}=10$
所以
$\begin{pmatrix} 1&0&0&0\\ 2&3&0&0\\ 4&5&6&0\\ 7&8&9&10\\ \end{pmatrix}$
5.2. Consider the Freudenstein and Roth test function
$f\left(\mathbf{x}\right)=f_1\left(\mathbf{x}\right)^2+f_2\left(\mathbf{x}\right)^2,\quad \mathbf{x}\in\mathbb{R}^2$
where
$\begin{aligned} &f_{1}(\mathbf{x})=-13+x_{1}+\left(\left(5-x_{2}\right) x_{2}-2\right) x_{2} \\ &f_{2}(\mathbf{x})=-29+x_{1}+\left(\left(x_{2}+1\right) x_{2}-14\right) x_{2} \end{aligned}$

(i) Show that the function $f$ has three stationary points. Find them and prove
that one is a global minimizer, one is a strict local minimum and the third is
a saddle point.
(ii) Use MATLAB to employ the following three methods on the problem of minimizing f:

the gradient method with backtracking and parameters $\left(s,\alpha,\beta\right)=\left(1,0.5,0.5\right)$ .
the hybrid Newton’s method with parameters $\left(s,\alpha,\beta\right)=\left(1,0.5,0.5\right)$ .
damped Gauss-Newton’s method with a backtracking line search strategy with parameters $\left(s,\alpha,\beta\right)=\left(1,0.5,0.5\right)$ .

All the algorithms should use the stopping criteria $\|\nabla f\left(\mathbf{x}\right)\|\le 10^{-5}$ . Each
algorithm should be em/loyed four times on the following four starting
points: $50,7)^{T},(20,7)^{T},(20,-18)^{T},(5,-10)^{T}$ . For each of the four starting points, compare the number of iterations and the point to which each
method converged. If a method did not converge, explain why.
解：
(i)

$\frac{\partial f_1(\boldsymbol{x})}{\partial x_1}=\frac{\partial f_2(\boldsymbol{x})}{\partial x_1}=1$
$\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}=-3x_2^2+10x_2-2$
$\frac{\partial f_2(\boldsymbol{x})}{\partial x_2}=3x_2^2+2x_2-14$
$\frac{\partial^2 f_1(\boldsymbol{x})}{\partial x_2^2}=-6x_2+10$
$\frac{\partial^2 f_2(\boldsymbol{x})}{\partial x_2^2}=6x_2+2$
$\frac{\partial f(\boldsymbol{x})}{\partial x_1}=2(f_1(\boldsymbol{x})+f_2(\boldsymbol{x}))$
$\frac{\partial f(\boldsymbol{x})}{\partial x_2}=2(f_1(\boldsymbol{x})\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}+f_2(\boldsymbol{x})\frac{\partial f_2(\boldsymbol{x})}{\partial x_2})$
令 $\nabla f(\boldsymbol{x})=0\Rightarrow \frac{\partial f(\boldsymbol{x})}{\partial x_1}=\frac{\partial f(\boldsymbol{x})}{\partial x_2}=0$
所以 $f_2(\boldsymbol{x})=-f_1(\boldsymbol{x})$
$\begin{aligned} \frac{\partial f(\boldsymbol{x})}{\partial x_2}&=2(f_1(\boldsymbol{x})\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}+f_2(\boldsymbol{x})\frac{\partial f_2(\boldsymbol{x})}{\partial x_2})\\ &=2f_1(\boldsymbol{x})(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}-\frac{\partial f_2(\boldsymbol{x})}{\partial x_2})\\ &=2f_1(\boldsymbol{x})(3x_2^2-4x_2-6)\\ &=0 \end{aligned}$
$\Rightarrow f_1(x)=f_2(x)=0 \text{或者} \frac{\partial f_1(\boldsymbol{x})}{\partial x_2}=\frac{\partial f_2(\boldsymbol{x})}{\partial x_2}$
1）当 $f_1(\boldsymbol{x})=f_2(\boldsymbol{x})=0$ 时
$f_1(\boldsymbol{x})-f_2(\boldsymbol{x})=16+(-2x_2^2+4x_2+12)x_2=-2(x_2-4)((x_2+1)^2+1)$
$\Rightarrow \begin{cases} x_1=5\\ x_2=4 \end{cases}$
$f(\boldsymbol{x})\ge f((5,4)^T)=0$
显然是全局最小值
2)当 $\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}=\frac{\partial f_2(\boldsymbol{x})}{\partial x_2}$ 时
$\Rightarrow 3x_2^2-4x_2-6=0\Rightarrow x_2=\frac{2\pm \sqrt{22}}{3}$
$\begin{aligned} f_1(\boldsymbol{x})+f_2(\boldsymbol{x})&=0\\ -42+2x_1+(5x_2-x_2^2-2+x_2^2+x_2-14)x_2&=0\\ -42+2x_1+(6x_2-16)x_2&=0\\ -21+x_1+3x_2^2-8x_2&=0\\ -21+x_1+4x_2+6-8x_2&=0\\ x_1&=4x_2+15\\ x_1&=\frac{53\pm 4\sqrt{22}}{3} \end{aligned}$
$\frac{\partial^2 f(\boldsymbol{x})}{\partial x_1^2}=2(\frac{\partial f_1(\boldsymbol{x})}{\partial x_1}+\frac{\partial f_2(\boldsymbol{x})}{\partial x_1})=4$
$\begin{aligned} \frac{\partial^2 f(\boldsymbol{x})}{\partial x_1\partial x_2}&=\frac{\partial^2 f(\boldsymbol{x})}{\partial x_2\partial x_1}\\ &=2(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}+\frac{\partial f_2(\boldsymbol{x})}{\partial x_2})\\ &=4\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}\\ &=2(12x_2-16)\\ &=8(3x_2-4) \end{aligned}$
$\begin{aligned} &\quad \frac{\partial^2 f(\boldsymbol{x})}{\partial x_2^2}\\ &=2((\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2+f_1(\boldsymbol{x})\frac{\partial^2 f_1(\boldsymbol{x})}{\partial x_2^2}+(\frac{\partial f_2(\boldsymbol{x})}{\partial x_2})^2+f_2(\boldsymbol{x})\frac{\partial^2 f_2(\boldsymbol{x})}{\partial x_2^2})\\ &=2(2(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2+f_1(\boldsymbol{x})(\frac{\partial^2 f_1(\boldsymbol{x})}{\partial x_2^2}-\frac{\partial^2 f_2(\boldsymbol{x})}{\partial x_2^2}))\\ &=2(2(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2+f_1(\boldsymbol{x})(-12x_2+8))\\ &=2(2(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2-4f_1(\boldsymbol{x})(3x_2-2))\\ &=4((\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2-2f_1(\boldsymbol{x})(3x_2-2))\\ \end{aligned}$
$\begin{aligned} \left|\nabla^2f(\boldsymbol{x})\right| &= \left| \begin{array}{cccc} 4&4\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}\\ 4\frac{\partial f_1(\boldsymbol{x})}{\partial x_2}&4((\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2-2f_1(\boldsymbol{x})(3x_2-2))\\ \end{array} \right|\\ &=16(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2-32f_1(\boldsymbol{x})(3x_2-2)-16(\frac{\partial f_1(\boldsymbol{x})}{\partial x_2})^2\\ &=-32f_1(\boldsymbol{x})(3x_2-2) \end{aligned}$
$\begin{aligned} &\quad f_1(\boldsymbol{x})\\ &=-13+x_1+((5-x_2)x_2-2)x_2\\ &=-13+x_1+(5x_2-x_2^2-2)x_2\\ &=-13+x_1+(5x_2-\frac{4x_2+6}{3}-2)x_2\\ &=-13+x_1+\frac{1}{3}(15x_2-4x_2-6-6)x_2\\ &=-13+x_1+\frac{1}{3}(11x_2-12)x_2\\ &=-13+x_1+\frac{1}{3}(11x_2^2-12x_2)\\ &=-13+x_1+\frac{1}{3}(11\frac{4x_2+6}{3}-12x_2)\\ &=-13+x_1+\frac{1}{9}(44x_2+66-36x_2)\\ &=-13+x_1+\frac{8}{9}x_2+\frac{22}{3}\\ &=\frac{-17}{3}+x_1+\frac{8}{9}x_2\\ &=\frac{4}{27}(85\pm 11\sqrt{22})\\ &>0 \end{aligned}$
$\left|\nabla^2f(\boldsymbol{x})\right|=\mp 32\sqrt{22}f_1(\boldsymbol{x})$

当 $\begin{cases} x_1=\frac{53+ 4\sqrt{22}}{3}\\ x_2=\frac{2+ \sqrt{22}}{3} \end{cases}$ 时， $\left|\nabla^2f(\boldsymbol{x})\right|<0$ ,所以特征值有正有负，所以 $\nabla^2f(\boldsymbol{x})$ 不定，所以是鞍点
当 $\begin{cases} x_1=\frac{53- 4\sqrt{22}}{3}\\ x_2=\frac{2- \sqrt{22}}{3} \end{cases}$ 时, $\left|\nabla^2f(\boldsymbol{x})\right|>0$ ,所以 $\nabla^2f(\boldsymbol{x})\succ 0$ ,所以是严格局部极小值

(ii)
回溯法

function [x,fun_val]=gradient_method_backtracking(f,g,x0,s,alpha,...
    beta,epsilon)
% Gradient method with backtracking stepsize rule
%
% INPUT
%=======================================
% f ......... objective function
% g ......... gradient of the objective function
% x0......... initial point
% s ......... initial choice of stepsize
% alpha ..... tolerance parameter for the stepsize selection
% beta ...... the constant in which the stepsize is multiplied
% at each backtracking step (0<beta<1)
% epsilon ... tolerance parameter for stopping rule
% OUTPUT
%=======================================
% x ......... optimal solution (up to a tolerance)
%             of min f(x)
% fun_val ... optimal function value
x=x0;
grad=g(x);
fun_val=f(x);
iter=0;
while (norm(grad)>epsilon)
    iter=iter+1;
    t=s;
    while (fun_val-f(x-t*grad)<alpha*t*norm(grad)^2)
        t=beta*t;
    end
    x=x-t*grad;
    fun_val=f(x);
    grad=g(x);
    fprintf('iter_number = %3d norm_grad = %2.6f fun_val = %2.6f \n',...
        iter,norm(grad),fun_val);
end

混合牛顿

function x=newton_hybrid(f,g,h,x0,alpha,beta,epsilon)
% Hybrid Newton’s method
%
% INPUT
%=======================================
% f ......... objective function
% g ......... gradient of the objective function
% h ......... hessian of the objective function
% x0......... initial point
% alpha ..... tolerance parameter for the stepsize selection strategy
% beta ...... the proportion in which the stepsize is multiplied
% at each backtracking step (0<beta<1)
% epsilon ... tolerance parameter for stopping rule
% OUTPUT
%=======================================
% x ......... optimal solution (up to a tolerance)
%             of min f(x)
% fun_val ... optimal function value
x=x0;
gval=g(x);
hval=h(x);
[L,p]=chol(hval,'lower');
if (p==0)
    d=L'\(L\gval);
else
    d=gval;
end
iter=0;
while ((norm(gval)>epsilon)&&(iter<10000))
    iter=iter+1;
    t=1;
    while(f(x-t*d)>f(x)-alpha*t*gval'*d)
        t=beta*t;
    end
    x=x-t*d;
    fprintf('iter= %2d f(x)=%10.10f\n',iter,f(x))
    gval=g(x);
    hval=h(x);
    [L,p]=chol(hval,'lower');
    if (p==0)
        d=L'\(L\gval);
    else
        d=gval;
    end
end
if (iter==10000)
    fprintf('did not converge\n')
end

阻尼高斯牛顿

function [x,fun_val]=damped_Gauss_Newtow(g,grad,J,F,x0,s,alpha,...
beta,epsilon)
% Gradient method with backtracking stepsize rule
%
% INPUT
%=======================================
% g ......... objective function
% grad ...... gradient of the objective function
% J ......... Jacobian matrix
% F ......... vector-valued function
% x0......... initial point
% s ......... initial choice of stepsize
% alpha ..... tolerance parameter for the stepsize selection
% beta ...... the constant in which the stepsize is multiplied
%             at each backtracking step (0<beta<1)
% epsilon ... tolerance parameter for stopping rule
% OUTPUT
%=======================================
% x ......... optimal solution (up to a tolerance)
%             of min f(x)
% fun_val ... optimal function value
x=x0;
J_val=J(x);
F_val=F(x);
d=(J_val'*J_val)\(J_val'*F_val);
fun_val=g(x);
gval=grad(x);
iter=0;
while (norm(gval)>epsilon&&(iter<10000))
    iter=iter+1;
    t=s;
    while (fun_val-g(x-t*d)<alpha*t*gval'*d)
        t=beta*t;
    end
    x=x-t*d;
    J_val=J(x);
    F_val=F(x);
    d=(J_val'*J_val)\(J_val'*F_val);
    fun_val=g(x);
    gval=grad(x);
    fprintf('iter_number = %3d norm_grad = %2.6f fun_val = %2.6f \n',...
        iter,norm(gval),fun_val);
end
if (iter==10000)
    fprintf('did not converge\n')
end

用到的数据

clear;
s=1;
alpha=0.5;
beta=0.5;
epsilon=1e-5;

f1=@(x)-13+x(1)+((5-x(2))*x(2)-2)*x(2);
f1_1=@(x)1;
f1_1_1=@(x)0;
f1_2=@(x)-3*x(2)^2+10*x(2)-2;
f1_2_2=@(x)-6*x(2)+10;

f2=@(x)-29+x(1)+((x(2)+1)*x(2)-14)*x(2);
f2_1=@(x)1;
f2_1_1=@(x)0;
f2_2=@(x)3*x(2)^2+2*x(2)-14;
f2_2_2=@(x)6*x(2)+2;

f=@(x)f1(x)^2+f2(x)^2;
f_1=@(x)2*f1(x)+2*f2(x);
f_1_1=@(x)2*f1_1(x)+2*f2_1(x);
f_1_2=@(x)2*f1_2(x)+2*f2_2(x);
f_2=@(x)2*f1(x)*f1_2(x)+2*f2(x)*f2_2(x);
f_2_1=@(x)2*f1_2(x)+2*f2_2(x);
f_2_2=@(x)2*(f1_2(x)^2+f1(x)*f1_2_2(x)+f2_2(x)^2+f2(x)*f2_2_2(x));
gradient=@(x)[f_1(x);f_2(x)];
hessian=@(x)[f_1_1(x),f_1_2(x);f_2_1(x),f_2_2(x)];


F=@(x)[f1(x);f2(x)];
J=@(x)[f1_1(x),f1_2(x);f2_1(x),f2_2(x)];

global_min_x=[5;4];
local_min_x=[(53-4*sqrt(22))/3;(2-sqrt(22))/3];
saddle_x=[(53+4*sqrt(22))/3;(2+sqrt(22))/3];

x1=[-50;7];
x2=[20;7];
x3=[20;-18];
x4=[5;-10];

调用

data;
gradient_method_backtracking(f,gradient,x1,s,alpha,beta,epsilon);
gradient_method_backtracking(f,gradient,x2,s,alpha,beta,epsilon);
gradient_method_backtracking(f,gradient,x3,s,alpha,beta,epsilon);
gradient_method_backtracking(f,gradient,x4,s,alpha,beta,epsilon);

newton_hybrid(f,gradient,hessian,x1,alpha,beta,epsilon);
newton_hybrid(f,gradient,hessian,x2,alpha,beta,epsilon);
newton_hybrid(f,gradient,hessian,x3,alpha,beta,epsilon);
newton_hybrid(f,gradient,hessian,x4,alpha,beta,epsilon);

damped_Gauss_Newtow(f,gradient,J,F,x1,s,alpha,beta,epsilon);
damped_Gauss_Newtow(f,gradient,J,F,x2,s,alpha,beta,epsilon);
damped_Gauss_Newtow(f,gradient,J,F,x3,s,alpha,beta,epsilon);
damped_Gauss_Newtow(f,gradient,J,F,x4,s,alpha,beta,epsilon);

1.使用gradient method with backtracking时
当 $x_0=(-50,7)^T$ 时，经过2252次迭代后，收敛到全局最小值 $5,4)^T$
当 $x_0=(20,7)^T$ 时，经过2447次迭代后，收敛到全局最小值 $5,4)^T$
当 $x_0=(20,-18)^T$ 时，经过2472次迭代后，收敛到全局最小值 $(\frac{53- 4\sqrt{22}}{3},\frac{2- \sqrt{22}}{3})^T$
当 $x_0=(5,-10)^T$ 时，经过2123次迭代后，收敛到全局最小值 $5,4)^T$
2.使用hybrid Newtow’s method时
当 $x_0=(-50,7)^T$ 时，经过8次迭代后，收敛到全局最小值 $5,4)^T$
当 $x_0=(20,7)^T$ 时，经过8次迭代后，收敛到全局最小值 $5,4)^T$
当 $x_0=(20,-18)^T$ 时，经过16次迭代后，收敛到全局最小值 $(\frac{53- 4\sqrt{22}}{3},\frac{2- \sqrt{22}}{3})^T$
当 $x_0=(5,-10)^T$ 时，经过13次迭代后，收敛到全局最小值 $(\frac{53- 4\sqrt{22}}{3},\frac{2- \sqrt{22}}{3})^T$
3.使用damped Gauss-Newtow’s method时
当 $x_0=(-50,7)^T$ 时，经过29次迭代后，收敛到全局最小值 $5,4)^T$
当 $x_0=(20,7)^T$ 时，经过29次迭代后，收敛到全局最小值 $5,4)^T$
当 $x_0=(20,-18)^T$ 时，不收敛
当 $x_0=(5,-10)^T$ 时，不收敛
因为使用backtracking搜索时，最后t趋于0,导致没有走动，所以不收敛

5.3. Let $f$ be a twice continuously differentiable function satisfying $L\mathbf{I}\succeq \nabla^2 f\left(\mathbf{x}\right)\succeq m\mathbf{I}$ for some $L > m > 0$ and let $\mathbf{x}^*$ be the unique minimizer of $f$ over $\mathbb{R}^n$ .
(i)Show that
$f\left(\mathbf{x}\right)-f\left(\mathbf{x}^*\right)\ge \frac{m}{2}\|\mathbf{x}-\mathbf{x}^*\|^2$
for any $\mathbf{x}\in\mathbb{R}^n$ .
(ii)Let $\left\{\mathbf{x}_k\right\}_{k\ge 0}$ be the sequence generated by damped Newton’s method with constant stepsize $t_k=\frac{m}{L}$ .Show that
$f\left(\mathbf{x}_{k}\right)-f\left(\mathbf{x}_{k+1}\right) \geq \frac{m}{2 L} \nabla f\left(\mathbf{x}_{k}\right)^{T}\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1} \nabla f\left(\mathbf{x}_{k}\right)$
(iii) Show that $\mathbf{x}_k \to \mathbf{x}^*$ as $k\to \infty$ .

解：
(i)
$f\left(\mathbf{x}\right)-f\left(\mathbf{x}^*\right)=\nabla f\left(\mathbf{x}^*\right)^T\left(\mathbf{x}-\mathbf{x}^*\right)+\frac{1}{2}\left(\mathbf{x}-\mathbf{x}^*\right)^T\nabla^2f\left(\mathbf{x}^*\right)\left(\mathbf{x}-\mathbf{x}^*\right)\ge\frac{m}{2}\|\mathbf{x}-\mathbf{x}^*\|^2$
(ii)
因为 $L\mathbf{I}\succeq \nabla^2 f\left(\mathbf{x}\right)$
所以 $\|\nabla^2 f\left(\mathbf{x}\right)\|\le L$
所以 $f\in C_{L}^{1,1}$
根据下降引理
$\begin{aligned} &f\left(\mathbf{x}_k\right)-f\left(\mathbf{x}_{k+1}\right)\\ \ge&\nabla f\left(\mathbf{x}_k\right)^T\left(\mathbf{x}_k-\mathbf{x}_{k+1}\right)-\frac{L}{2}\|\mathbf{x}_{k}-\mathbf{x}_{k+1}\|^2\\ =& \frac{m}{L}\nabla f\left(\mathbf{x}_{k}\right)^{T}\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1} \nabla f\left(\mathbf{x}_{k}\right)-\frac{L}{2}\frac{m^2}{L^2}\nabla f\left(\mathbf{x}_{k}\right)^T\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1}\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1}\nabla f\left(\mathbf{x}_{k}\right)\\ \ge&\frac{m}{L}\nabla f\left(\mathbf{x}_{k}\right)^{T}\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1} \nabla f\left(\mathbf{x}_{k}\right)-\frac{m}{2L}\nabla f\left(\mathbf{x}_{k}\right)^{T}\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1} \nabla f\left(\mathbf{x}_{k}\right)\\ =&\frac{m}{2L}\nabla f\left(\mathbf{x}_{k}\right)^{T}\left(\nabla^{2} f\left(\mathbf{x}_{k}\right)\right)^{-1} \nabla f\left(\mathbf{x}_{k}\right) \end{aligned}$
(iii)不会

Nightmare004

关注

1
点赞
踩
4

收藏

觉得还不错? 一键收藏
打赏
0
评论
Introduction to nonlinear optimization第五章习题

5.1. Find without MATLAB the Cholesky factorization of the matrixA=(1247213233842377122738122294)A=\left(\begin{array}{cccc}1 & 2 & 4 & 7 \\2 & 13 & 23 & 38 \\4 & 23 & 77 & 122 \\7 & 38 & 122 & 294\end
复制链接

扫一扫