往期文章链接目录
文章目录
Consider an optimization problem in the standard form (we call this a primal problem):
We denote the optimal value of this as p ⋆ p^\star p⋆. We don’t assume the problem is convex.
The Lagrange dual function
We define the Lagrangian L L L associated with the problem as
L ( x , λ , v ) = f 0 ( x ) + ∑ i = 1 m λ i f i ( x ) + ∑ i = 1 p v i h i ( x ) L(x,\lambda, v) = f_0(x) + \sum^m_{i=1}\lambda_if_i(x) + \sum^p_{i=1}v_ih_i(x) L(x,λ,v)=f0(x)+i=1∑mλifi(x)+i=1∑pvihi(x)
We call vectors λ \lambda λ and v v v the dual variables or Lagrange multiplier vectors associated with the problem (1).
We define the Lagrange dual function (or just dual function) g g g as the minimum value of the Lagrangian over x x x: for λ ∈ R m , v ∈ R p \lambda \in \mathbf{R}^m, v\in\mathbf{R}^p λ∈Rm,v∈Rp,
g ( λ , v ) = i n f x ∈ D L ( x , λ , v ) = i n f x ∈ D ( f 0 ( x ) + ∑ i = 1 m λ i f i ( x ) + ∑ i = 1 p v i h i ( x ) ) g(\lambda,v) = \mathop{\rm inf}\limits_{x\in \mathcal{D}} L(x, \lambda, v) = \mathop{\rm inf}\limits_{x\in \mathcal{D}} \left( f_0(x) + \sum^m_{i=1}\lambda_if_i(x) + \sum^p_{i=1}v_ih_i(x)\right) g(λ,v)=x∈DinfL(x,λ,v)=x∈Dinf(f0(x)+i=1∑mλifi(x)+i=1∑pvihi(x))
Note that once we choose an x x x, f i ( x ) f_i(x) fi(x) and h i ( x ) h_i(x) hi(x) are fixed and therefore the dual function is a family of affine functions of ( λ \lambda λ, v v v), which is concave even the problem (1) is not convex.
Lower bound property
The dual function yields lower bounds on the optimal value p ⋆ p^\star p⋆ of the problem (1): For any λ ⪰ 0 \lambda \succeq 0 λ⪰0 and any v v v we have
g ( λ , v ) ≤ p ⋆ g(\lambda,v) \leq p^\star g(λ,v)≤p⋆
Suppose x ~ \tilde{x} x~ is a feasible point for the problem (1), i.e., f i ( x ~ ) ≤ 0 f_i(\tilde{x}) \leq 0 fi(x~)≤0 and h i ( x ~ ) = 0 h_i(\tilde{x}) = 0 hi(x~)=0, and λ ⪰ 0 \lambda \succeq 0 λ⪰0. Then we have
L ( x ~ , λ , v ) = f 0 ( x ~ ) + ∑ i =