拉格朗日乘子法简述 - A Brief Tutorial of Using Lagrange Multipliers

最新推荐文章于 2023-10-25 10:51:42 发布

止于至玄

最新推荐文章于 2023-10-25 10:51:42 发布

阅读量1k

点赞数

分类专栏： Convex Optimization 文章标签：凸优化

本文链接：https://blog.csdn.net/philthinker/article/details/66473983

版权

Convex Optimization 专栏收录该内容

8 篇文章 3 订阅

订阅专栏

Lagrange Multipliers are used to solve the optimal value of multivariate functions under a group of constraints. By lagrange multipliers, we can convert an optimal problem with $d$ variables and $k$ constraints to one with $d+k$ variables without constraint.

Equality Constraint
Suppose $x\in \mathbb{R}^{d}$ , we would like to solve some optimal value $x^{*}$ s. t. $\min_{x}f(x)$ and $g(x)=0$ , i.e.
$min x f (x), s . t . g (x) = 0$ $\min_{x}f(x),\quad s.t. \quad g(x)=0$
For simplicity, we omit the geometric explanation of the optimal problem. Define the Lagrange multiplier $\lambda\neq 0$ s.t.
$\nabla f (x) + λ \nabla g (x) = 0$ $\nabla f(x)+\lambda\nabla g(x)=0$
Define the corresponding Lagrange funcntion as
$L (x, λ) = f (x) + λ g (x)$ $L(x,\lambda)= f(x)+\lambda g(x)$
So
$\nabla x L (x, λ) = \nabla f (x) + λ \nabla g (x) = 0$ $\nabla_{x} L(x,\lambda)=\nabla f(x)+\lambda\nabla g(x)=0$
$\nabla λ L (x, λ) = g (x) = 0$ $\nabla_{\lambda} L(x,\lambda)=g(x)=0$
Obviously, the original optimal problem is converted to the new optimal problem with no constraint.
Inequality Constraint
In the inequality constraint case, the optimal problem can be defined as
$min x f (x), s . t . g (x) \leq 0$ $\min_{x}f(x),\quad s.t. \quad g(x)\leq 0$
When $g(x)<0$ , the constraint $g(x)\leq 0$ makes no sense which means that we can take $\nabla f(x)=0$ to solve the optimal problem. In addition, when $g(x)=0$ , the inequality constraint degrades to the equality constraint.
In summary, the KKT (Karush-Kuhn-Tucker) conditions are always satisfied:
$⎧ ⎩ ⎨ ⎪ ⎪ g (x) \leq 0; λ \geq 0; λ g (x) = 0$ $\left\{\begin{aligned} &g(x) \leq 0;\\ &\lambda \geq 0;\\ &\lambda g(x)=0 \end{aligned} \right.$
With efficiency concerned, we show the solution of the optimal problem together with next problem. Go on.
Multi-constraints
Consider an optimal problem with $m$ equality constraints and $n$ inequality constraints. In addition there is a feasible region $\mathbb{D}\subset\mathbb{R}^{d}$ :
$min x s . t . f (x) h i (x) = 0, i = 1, 2, \dots, m, g j (x) \leq 0, j = 1, 2, \dots, n$ $\begin{aligned} \min_{x} & f(x)\\ s.t. & h_{i}(x)=0, i=1,2,\dots,m,\\ & g_{j}(x)\leq 0, j=1,2,\dots,n \end{aligned}$
Define the lagrange function as
$L (x, λ, μ) = f (x) + \sum i = 1 m λ i h i (x) + \sum j = 1 n μ j g j (x)$ $L(x,\lambda,\mu)=f(x)+\sum_{i=1}^{m}\lambda_{i}h_{i}(x)+\sum_{j=1}^{n}\mu_{j}g_{j}(x)$
Since there are inequality constraints, the KKT condition is followed:
$⎧ ⎩ ⎨ ⎪ ⎪ ⎪ ⎪ g j (x) \leq 0; μ j \geq 0; μ j g j (x) = 0.$ $\left\{ \begin{aligned} &g_{j}(x)\leq 0;\\ &\mu_{j}\geq 0;\\ &\mu_{j}g_{j}(x)=0. \end{aligned}\right.$
Solving the original optimal problem, also known as primal problem, can be achieved by solving the corresponding dual problem. Then the Lagrange dual function $\Gamma:\mathbb{R}^{m}\times\mathbb{R}^{n}\to \mathbb{R}$ is defined as
$Γ (λ, μ) = inf x \in D L (x, λ, μ) = inf x \in D ⎛ ⎝ f (x) + \sum i = 1 m λ i h i (x) + \sum j = 1 n μ j g j (x) ⎞ ⎠$ $\begin{aligned}\Gamma(\lambda,\mu) &= \inf_{x\in \mathbb{D}}L(x,\lambda,\mu)\\ & = \inf_{x\in \mathbb{D}}\left(f(x)+\sum_{i=1}^{m}\lambda_{i}h_{i}(x)+\sum_{j=1}^{n}\mu_{j}g_{j}(x)\right) \end{aligned}$
Evidently, $\sum_{i=1}^{m}\lambda_{i}h_{i}(x)+\sum_{j=1}^{n}\mu_{j}g_{j}(x)\leq 0$ . Let $\tilde{x}\in\mathbb{D}$ , then
$Γ (λ, μ) = inf x \in D L (x, λ, μ) \leq L (x ~, λ, μ) \leq f (x ~)$ $\Gamma(\lambda,\mu)= \inf_{x\in \mathbb{D}}L(x,\lambda,\mu)\leq L(\tilde{x},\lambda,\mu)\leq f(\tilde{x})$ That is to say, the dual function shows the lower bound of the value of the primal problem. Then the dual problem is given by
$max λ, μ Γ (λ, μ) s . t . μ \geq 0$ $\max_{\lambda,\mu}\Gamma(\lambda,\mu)\quad s.t. \quad \mu\geq 0$
The dual problem is always a convex optimal problem, regardless of the convexity of the primal problem.

Let $p^{*}$ be the optimal value of the primal problem, $d^{*}$ be the optimal value of the dual problem. It has been shown that $d^{*}\leq p^{*}$ which is also known as weak duality. If $d^{*}= p^{*}$ , then strong duality holds. Generally, the strong duality does not always hold. However, when the primal problem is a convex optimal problem, i.e. $f(x),g_{j}(x)$ are convex functions, $h_{i}(x)$ are affine functions and there exists a least one $\tilde{x}$ making the inequality constraints strictly hold, the strong duality holds. In this case, take the derivative of the Lagrange function over $x$ , $\lambda$ and $\mu$ and set it to be zero. Then we can solve the dual problem as well as the primal problem.