Convex Optimization 读书笔记 (7)

最新推荐文章于 2021-04-11 18:03:55 发布

来碗拿铁️

最新推荐文章于 2021-04-11 18:03:55 发布

阅读量516

点赞数 1

分类专栏：读书笔记凸优化文章标签：线性代数算法

本文链接：https://blog.csdn.net/qq_39337332/article/details/109619945

版权

读书笔记同时被 2 个专栏收录

11 篇文章 1 订阅

订阅专栏

凸优化

10 篇文章 0 订阅

订阅专栏

Chapter8: Geometric problems

8.1 Projection on a set

The distance of a point $x_0 ∈ \mathbf{R}^n$ to a closed set $C⊆\mathbf{R}^n$ , in the norm $∥ \cdot ∥$ , is defined as
${\rm \bold{dist}}(x_0,C)=\inf\{ ||x_0-x|| \mid x\in C \}$ We refer to any point $z \in C$ which is closest to $x_0 $ satisfies $z − x_0∥ = dist(x_0, C)$ , as a projection of $x_0$ on $C$ .

8.1.1 Projecting a point on a convex set

If $C$ is convex, then we can compute the projection $P_C(x_0)$ and the distance ${\rm \bold{dist}}(x_0,C)$ by solving a convex optimization problem:
$\begin{aligned} {\rm minimize} \ \ \ \ & ||x-x_0|| \\ {\rm subject \ to} \ \ \ \ & f_i(x)\leq0, \ \ \ \ i=1,...m \\ & Ax=b \end{aligned}$

8.1.2 Separating a point and a convex set

If $P_C(x_0)$ denotes the Euclidean projection of $x_0$ on $C$ , where $x_0 \notin C$ , then the hyperplane
$(P_C (x_0) − x_0)^T (x − \frac{1}{2}(x_0 + P_C (x_0))) = 0$ (strictly) separates $x_0$ from $C$ .

In other norms, the clearest link between the projection problem and the separating hyperplane problem is via Lagrange duality.

8.1.3 Projection and separation via indicator and support functions

Define indicator function $I_C$ and the support function $S_C$ of the set $C$ :
$S_C(x)=\sup_{y\in C} x^Ty\\ I_C(x)=\left\{ \begin{array}{rcl} 0, \ \ & x\in C \\ \infty, \ \ & x\notin C\end{array} \right.$ The problem of projecting $x_0$ on a closed convex set $C$ can be expressed compactly as
$\begin{aligned} {\rm minimize} \ \ \ \ & ||x-x_0|| \\ {\rm subject \ to} \ \ \ \ & I_C(x)\leq0 \end{aligned}$ The dual function of this problem is
$g(z,\lambda)=\left\{ \begin{array}{rcl} z^Tx_0-S_C(z), & \ \ \ \ ||z||_*\leq1, \ \ \lambda \geq0 \\ -\infty, & \ \ \ \ {\rm otherwise} \end{array} \right.$ so we obtain the dual problem
$\begin{aligned} {\rm maximize} \ \ \ \ & z^Tx_0-S_C(z) \\ {\rm subject \ to} \ \ \ \ & ||z||_*\leq1 \end{aligned}$

8.2 Distance between sets

The distance between two sets $C$ and $D$ , in a norm $∥ \cdot ∥$ , is defined as
${\rm \bold{dist}}(C,D)=\inf\{ ||x-y||\mid x\in C,y\in D \}$

8.2.1 Computing the distance between convex sets

We can find $\mathbf{dist}(C, D)$ by solving the convex optimization problem
$\begin{aligned} {\rm minimize} \ \ \ \ & ||x-y|| \\ {\rm subject \ to} \ \ \ \ & f_i(x)\leq0, \ \ \ \ i=1,...m \\ & g_i(y)\leq0, \ \ \ \ i=1,...p \end{aligned}$

8.2.2 Separating convex sets

First express the distance between convex sets in the following equivalent form:
$\begin{aligned} {\rm minimize} \ \ \ \ & ||w|| \\ {\rm subject \ to} \ \ \ \ & f_i(x)\leq0, \ \ \ \ i=1,...m \\ & g_i(y)\leq0, \ \ \ \ i=1,...p \\ & x-y=w \end{aligned}$ The dual function is
$\begin{aligned} g(\lambda,z,\mu) &= \inf_{x,y,w}(||w||+\sum_{i=1}^m\lambda_if_i(x)+\sum_{i=1}^p\mu_ig_i(x)+z^T(x-y-z))\\ &=\left\{ \begin{array}{ll} \inf_x(\sum_{i=1}^m\lambda_if_i(x)+z^Tx) + \inf_y(\sum_{i=1}^p\mu_ig_i(x)-z^Ty) & \ \ \ \ ||z||_*\leq1, \ \ \\ -\infty & \ \ \ \ {\rm otherwise} \end{array} \right. \end{aligned}$
which results in the dual problem
$\begin{aligned} {\rm maximize} \ \ \ \ & \inf_x(\sum_{i=1}^m\lambda_if_i(x)+z^Tx) + \inf_y(\sum_{i=1}^p\mu_ig_i(x)-z^Ty) \\ {\rm subject \ to} \ \ \ \ & ||z||_*\leq1 \\ & \lambda\succeq0, \ \ \mu\succeq0 \end{aligned}$

8.2.3 Distance and separation via indicator and support functions

The problem of finding the distance between two convex sets can be posed as the convex problem
$\begin{aligned} {\rm minimize} \ \ \ \ & ||x-y|| \\ {\rm subject \ to} \ \ \ \ & I_C(x)\leq0 \\ & I_D(x)\leq0 \\ \end{aligned}$ The dual of this problem is
$\begin{aligned} {\rm maximize} \ \ \ \ & -S_C(-z)-S_D(z) \\ {\rm subject \ to} \ \ \ \ & ||z||_*\leq1 \\ \end{aligned}$

8.3 Euclidean distance and angle problems

Suppose $a_1, . . . , a_n$ is a set of vectors in $\mathbf{R}^n$ , which we assume (for now) have known Euclidean lengths
$l_1=||a_1||_2,...,l_n=||a_n||_2$ We will refer to the set of vectors as a configuration, or, when they are independent, a basis.

8.3.1 Gram matrix and realizability

The Gram matrix of vectors ${ a_1,...,a_n \}$ is
$G=A^TA, \ \ \ \ A=[a_1 \ \ \cdots \ \ a_n]$ which is $G_{ij}=a_i^Ta_j,G_{ii}=l_i^2$ . The distance between $a_i$ and $a_j$ is
$d_{ij}=||a_i-a_j||_2=(G_{ii}+G_{jj}-2G_{ij})^{\frac{1}{2}}$ The correlation coefficient $ρ_{ij}$ between (nonzero) $a_i$ and $a_j$ is given by
$\rho_{ij}=\frac{a_i^Ta_j}{||a_i||_2||a_j||_2}=\frac{G_{ij}}{\sqrt{G_{ii}}\sqrt{G_{jj}}}$ The angel $\theta_{ij}$ is
$\theta_{ij}=\cos^{-1}\rho_{ij}$ A set of lengths, distances, and angles (or correlation coefficients) is realizable if and only if the associated Gram matrix $G$ is positive semidefinite, and has diagonal elements $l_1^2, . . . , l_n^2$ .

8.3.2 Problems involving angles only

Suppose we only care about the angles (or correlation coefficients) between the vectors, and do not specify the lengths or distances between them. Then the gram matrix
$G=\mathbf{diag}(l)C\mathbf{diag}(l)$ where $C_{ij}=\cos\theta_{ij}$ .

8.3.3 Euclidean distance problems

In a Euclidean distance problem, we are concerned only with the distances between the vectors, $d_{ij}$ , and do not care about the lengths of the vectors, or about the angles between them.
A Euclidean distance matrix is $D\in \mathbf{S}^n$ with nonnegative elements, zero diagonal and satisfied
$G=(z\bold{1}^T+\bold{1}z^T-D)/2\succeq0 {\rm \ for \ some \ }z\succeq0$ where $D_{ij}=d_{ij}^2$ .
In summary, a matrix $\mathbf{S}^n$ is a Euclidean distance matrix if and only if
$D_{ii}=0, \ \ i=1,...,n \ \ \ \ D_{ij}\geq0, \ \ i,j=1,...,n \\ (I-\frac{1}{n}\bold{1}\bold{1}^T)D(I-\frac{1}{n}\bold{1}\bold{1}^T) \preceq 0$

8.4 Extremal volume ellipsoids

8.4.1 The Lowner-John ellipsoid

The minimum volume ellipsoid that contains a set $C$ is called the Lowner-John ellipsoid of the set $C$ , and is denoted $\mathcal{E}_{lj}$ . First denote ellipsoid as
$\mathcal{E}=\{ v\mid ||Av+b||_2\leq1 \}$ Computing the minimum volume ellipsoi containing $C$ can be expressed as
$\begin{aligned} {\rm minimize} \ \ \ \ & \log\det A^{-1} \\ {\rm subject \ to} \ \ \ \ & \sup_{v\in C}||Av+b||_2\leq1 \\ \end{aligned}$

8.4.2 Maximum volume inscribed ellipsoid

First denote ellipsoid as
$\mathcal{E}=\{ Bu+d\mid ||u||_2\leq1 \}$ We now consider the problem of finding the ellipsoid of maximum volume that lies inside a convex set $C$
$\begin{aligned} {\rm maximize} \ \ \ \ & \log\det B \\ {\rm subject \ to} \ \ \ \ & \sup_{||u||_2\leq1}I_C(Bu+d)\leq0 \\ \end{aligned}$

8.4.3 Affine invariance of extremal volume ellipsoids

If $\mathcal{E}_{\rm lj}$ is the Lowner-John ellipsoid of $C$ , and $\mathbf{R}^{n×n}$ is nonsingular, then the Lowner-John ellipsoid of $T C$ is $T\mathcal{E}_{\rm lj}$ . A similar result holds for the maximum volume inscribed ellipsoid.

8.5 Centering

8.5.1 Chebyshev center

The depth of a point $x \in C$ is defined as
$\ C ) \mathbf{depth}(x,C)=\mathbf{dist}(x,\mathbf{R}^n\backslash C)$ A Chebyshev center of the set $C$ is defined as any point of maximum depth in $C$ :
$\ C ) . x_{ \rm cheb}(C) = \arg\max \mathbf{depth}(x, C) = \arg\max \mathbf{dist}(x,\mathbf{R}^n\backslash C).$

8.5.2 Maximum volume ellipsoid center

As an extension of this idea, we define the maximum volume ellipsoid center of $C$ , denoted $x_{\rm mve}$ , as the center of the maximum volume ellipsoid that lies in $C$ .

8.5.3 Analytic center of a set of inequalities

The analytic center $x_{\rm ac}$ of a set of convex inequalities and linear equalities
$f_i(x)\leq0, \ \ i=1,...,m, \ \ Fx=g$ is defined as
$\begin{aligned} {\rm minimize} \ \ \ \ & -\sum_{i=1}^m\log(-f_i(x)) \\ {\rm subject \ to} \ \ \ \ & Fx=g \\ \end{aligned}$

8.6 Classification

In pattern recognition and classification problems we are given two sets of points in $\mathbf{R}^n$ , ${x_1,...,x_N\}$ and ${y_1,...,y_M\}$ , define a function
$f(x_i)>0, \ \ i=1,...,N \\ f(y_i)<0, \ \ i=1,...,M$ If these inequalities hold, we say that $f$ , or its 0-level set $\{x \mid f(x) = 0\}$ , separates, classifies, or discriminates the two sets of points.

8.6.1 Linear discrimination

In linear discrimination, we seek an affine function $f (x) = a^T x − b$ that classifies the points
$a^Tx_i −b>0, \ i=1,...,N, \ \ \ \ \ \ \ \ a^Ty_i −b<0, \ i=1,...,M.$ Since the strict inequalities are homogeneous in $a$ and $b$ , they are feasible if and only if the set of nonstrict linear inequalities
$a^Tx_i −b≥1, \ i=1,...,N, \ \ \ \ \ \ \ \ a^Ty_i −b≤−1, \ i=1,...,M$

8.6.2 Nonlinear discrimination

We can just as well seek a nonlinear function $f$ , from a given subspace of functions, that is positive on one set and negative on another:
$f(x_i) > 0, \ i = 1,...,N, \ \ \ \ \ \ \ \ f(y_i) < 0, \ i = 1,...,M.$

8.7 Placement and location

We have N points in $\mathbf{R}^2$ or $\mathbf{R}^3$ , and a list of pairs of points that must be connected by links. The positions of some of the $N$ points are fixed; our task is to determine the positions of the remaining points, i.e., to place the remaining points. The problem is to minimize
$\sum_{(i,j)\in\mathcal{A}}f_{ij}(x_i,x_j)$ where $\mathcal{A}$ is the set of all links in the graph, and $f_{ij} :\mathbf{R}^k\times\mathbf{R}^k →\mathbf{R}$ is a cost function associated with arc $(i, j)$ .

8.7.1 Linear facility location problems

In the simplest version of the problem the cost associated with arc $(i, j)$ is the distance between nodes $i$ and $j: f_{ij} (x_i, x_j ) = ∥x_i − x_j ∥$ , i.e., we minimize
$\sum_{(i,j)\in\mathcal{A}}∥x_i − x_j||$

8.7.2 Placement constraints

We can impose a constraint that limits the points $x_1, . . . , x_p$ (say) to lie in a bounding box with perimeter not exceeding $P_{max}$ , by adding the constraints
$u\preceq xi \preceq v, \ i=1,...,p, \ \ \ \ \ \ \ \ 2\bold{1}^T(v−u)≤P_{max},$ where $u, v$ are additional variables.

8.7.3 Nonlinear facility location problems

More generally, we can associate a cost with each arc that is a nonlinear increasing function of the length
${\rm minimize} \ \ \ \ \sum_{i<j}w_{ij}h(||x_i-x_j||) \\$ where $h$ is an increasing (on $\mathbf{R}^+$ ) and convex function, and $w_{ij} ≥ 0$ .

8.7.4 Location problems with path constraints

8.8 Floor planning

A floor planning problem can be considered an extension of a placement problem in two ways:

The objects to be placed are rectangles or boxes aligned with the axes (as opposed to points), and must not overlap.
Each rectangle or box to be placed can be reconfigured, within some limits. For example we might fix the area of each rectangle, but not the length and height separately.
In all floor planning problems, we require that the cells lie inside the bounding rectangle
$x_i\geq0, \ \ \ \ y_i\geq0,\ \ \ \ x_i +w_i ≤W,\ \ \ \ y_i +h_i ≤H, \ i=1,...,N.$ We also require that the cells do not overlap, except possibly on their boundaries:
$\mathbf{int}(C_i∩C_j)=∅ \ \ {\rm for} \ i\neq j.$

8.8.1 Relative positioning constraints

The idea of relative positioning constraints is to specify, for each pair of cells, one of the four possible relative positioning conditions, i.e., left, right, above, or below. One simple method to specify these constraints is to give two relations on $\{1,...,N\}: \mathcal{L}$ (meaning ‘left of’) and {1,…,N}: \mathcal{B} (meaning ‘below’). We then impose the constraint that $C_i$ is to the left of $C_j$ if $\mathcal{L}$ , and $C_i$ is below $C_j$ if $\mathcal{B}$ . This yields the constraints
$x_i+w_i ≤x_j \ {\rm for} \ (i,j)∈\mathcal{L}, \ \ \ \ y_i+h_i ≤y_j \ {\rm for} \ (i,j)∈\mathcal{B}$ We can use $\mathcal{H},\mathcal{V}$ to represent inequalities:
$x_i+w_i ≤x_j \ {\rm for} \ (i,j)∈\mathcal{H},\ \ \ \ y_i+h_i ≤y_j \ {\rm for} \ (i,j)∈\mathcal{V}$

8.8.2 Floor planning via convex optimization

We impose the bounding box constraints and the relative positioning constraints, which are linear inequalities.

8.8.3 Floor planning via geometric programming

The floor planning problem can also be formulated as a geometric program in the variables $x_i, y_i, w_i, h_i, W, H$ .

来碗拿铁️

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Convex Optimization 读书笔记 (7)

Chapter8: Geometric problems8.1 Projection on a setThe distance of a point x0∈Rnx_0 ∈ \mathbf{R}^nx0∈Rn to a closed set C⊆RnC⊆\mathbf{R}^nC⊆Rn, in the norm ∥⋅∥∥·∥∥⋅∥, is defined asdist(x0,C)=inf⁡{∣∣x0−x∣∣∣x∈C}{\rm \bold{dist}}(x_0,C)=\inf\{ ||x_0-x||
复制链接

扫一扫

专栏目录