DFP和BFGS的证明
Practical Methods of Optimization, 2nd Edition, Chapter 2
http://home.agh.edu.pl/~pba/pdfdoc/Numerical_Optimization.pdf
https://docs.scipy.org/doc/scipy/reference/generated/scipy.optimize.minimize.html
DFP和BFGS的迭代公式的证明大多文献都用待定系数来得到,最优化的证明很难找到。在这里给出详细证明。
Theorem 1 (DFP)
minB||B−Bk||
subject to: B=B⊤,Bsk=yk
此处: ||A||W≡||W1/2AW1/2||F ; W=G . G 是average Hessian
Proof:
令 γ=yk;δ=sk;ξ=γ−Bδ
Lagrangian function:
L=14trace(WE⊤WE)+trace(