Convex Optimization 读书笔记 (2)_determinant log concave-CSDN博客

本文链接：https://blog.csdn.net/qq_39337332/article/details/109312766

Chapter3: Convex Functions

3.1 Basic properties and examples

3.1.1 Definition

A function $f:\mathbf{R}^n\rightarrow\mathbf{R}$ is a convex function if $\mathbf{dom}\space f$ is convex set and for $x,y\in \mathbf{dom}\space f,\theta\in[0,1]$ , we have
$f(\theta x+(1-\theta)y)\leq \theta f(x)+(1-\theta)f(y)$

$f$ is concave if $- f$ is convex.

$f$ is convex if and only if for all $x\in \mathbf{dom}\space f$ , and for all $v$ , the function $g (t) = f (x + t v)$ is convex.

3.1.2 Extended-value extensions

If f is convex define its extended-value extension $\tilde{f}:\mathbf{R^n}\rightarrow\mathbf{R}\cup \{\infty\}$ :
$\tilde{f}=\left\{ \begin{array}{rcl} f(x) &x \in \mathbf{dom}\space f \\ \infty &x \notin \mathbf{dom}\space f \end{array}\right.$

3.1.3 First-order conditions

Suppose $f$ is a convex, then
$f(y)\geq f(x)+\nabla f(x)^T(y-x)$
holds for all $x,y\in \mathbf{dom}f$ .

3.1.4 Second-order conditions

$f$ is convex if and only if $\mathbf{dom}f$ is convex and for all $x\in \mathbf{dom}f$ , $\nabla^2f\succeq0$ .

3.1.5 Examples

Powers of absolute value. $x|^p$ for $p\geq1$ is convex.

Negative entropy. $x l o g x$ (either on $\mathbf{R}_{++}$ , or on $\mathbf{R}_+$ , defined as $0$ for $x = 0$ ) is convex.

Norms. Every norm on $\mathbf{R}^n$ is convex.

Log-sum-exp. The function $f(x) = \log(e^{x_1} +···+e^{x_n} )$ is convex on R .

Geometric mean. The geometric mean $(\prod_i^{n}x_i)^{\frac{1}{n}}$ is concave on $\mathbf{dom}f=\mathbf{R}_{++}^n$ .

Log-determinant. The function$ f (X ) = \log \det X $ is concave on $\mathbf{dom}f=\mathbf{S}_{++}^n$ .

3.1.6 Sublevel sets

The $\alpha$ -sublevel set of a function $f:\mathbf{R}^n\rightarrow\mathbf{R}$ is
$\{ x\mid x\in \mathbf{dom}f;f(x)\leq \alpha \}$
Sublevel sets of a convex function are convex.

3.1.7 Epigraph

A Epigraph of a function $f:\mathbf{R}^n\rightarrow\mathbf{R}$ is
$\mathbf{epi}f=\{ (x,t) \mid x\in \mathbf{dom}f;f(x)\leq t \}$
is a subset of $\mathbf{R}^{n+1}$ .

3.1.8 Jensen’s inequality and extensions

The basic inequality can be extended to
$f(\theta_1x_1+\cdots+\theta_nx_n)\leq \theta_1f(x_1)+\cdots+\theta_nf(x_n)$
where $\sum_i\theta_i=1,\theta_i>0,x_i \in \mathbf{dom}f,f$ is convex function. For intergral, that is
$f\left(\int_Sp(x)x\right)\leq \int_Sp(x)f(x)$
It is the expectation inequality
$f(\mathbb{E}(x))\leq\mathbb{E}(f(x))$

3.1.9 Inequalities

3.2 Operations that preserve convexity

3.2.1 Nonnegative weighted sums

Suppose $f_i$ is convex and $w_i>0$ ,
$\sum_iw_if_i(x)$
is convex.

3.2.2 Composition with an affine mapping

Suppose $f:\mathbf{R}^n\rightarrow\mathbf{R}, A\in \mathbf{R}^{n\times m},b\in \mathbf{R}^{n}$ , define $g:\mathbf{R}^m\rightarrow\mathbf{R}$ :
$g(x)=f(Ax+b),x\in \{ x\mid Ax+b\in \mathbf{dom}f \}$
its convexity is same with $f$ .

3.2.3 Pointwise maximum and supremum

If $f_1$ and $f_2$ are convex functions then their pointwise maximum $f$ , defined by
$f(x)=\max\{ f_1(x),f_2(x) \}, \mathbf{dom}f=\mathbf{dom}f_1\cap\mathbf{dom}f_2$
is convex.

For each $y\in \mathcal{A},f(x,y)$ is convex in $x$ , the pointwise supremum
$g(x)=\sup_{\mathcal{y\in A}}f(x,y)$
is convex. Where $\mathbf{dom}\space g=\{ x\mid (x,y)\in \mathbf{dom} \space g, \sup_{\mathcal{y\in A}}f(x,y) <\infty \}$ .

3.2.4 Composition

3.2.5 Minimization

If $f$ is convex in $(x, y)$ , and $C$ is a convex nonempty set, then the function
$g(x)=\inf_{y\in C}f(x,y)$
is convex.

3.2.6 Perspective of a function

If $f:\mathbf{R}^n\rightarrow\mathbf{R},$ the perspective of $f$ is $g:\mathbf{R}^{n+1}\rightarrow\mathbf{R}$ :
$g (x, t) = t f (x / t)$
with domian
$\mathbf{dom}g=\{ (x,t)\mid x/t\in \mathbf{dom}f,t>0\}$
It is convex if $f$ is convex.

3.3 The conjugate function

3.3.1 Definition and examples

If $KaTeX parse error: Undefined control sequence: \mbox at position 37: …rrow\mathbf{R},\̲m̲b̲o̲x̲{the function }…$ is called conjugate function if
$f^*(y) = \sup_{x\in \mathbf{dom}f}(y^Tx-f(x))$
The domain of the conjugate function consists of for which the supremum is finite.

3.3.2 Basic properties

Fenchel’s inequality
$f(x)+f^*(y)\geq x^Ty$
Conjugate of the conjugate

The conjugate of the conjugate of a convex function is the original function.

Differentiable functions

Let $z\in \mathbf{R}^n,y=\nabla f(z)$
$f^*(y)=z^T\nabla f(z)-f(z)$
Scaling and composition with affine transformation

Conjugate of $g (x) = a f (x) + b$ is $g^*(y)=af^*(y/a)-b$ .

Conjugate of $g (x) = f (A x + b)$ is $g^*(y)=f^*(A^{-1}y)-b^TA^{-T}y$ .

Sums of independent functions

If $f(x,y)=f_1(x)+f_2(y)$ , then $f^*(u,v)=f_1^*(u)+f_2^*(v)$ .

3.4 Quasiconvex functions

3.4.1 Definition and examples

A function $f:\mathbf{R}^n\rightarrow\mathbf{R}$ is called quasiconvex (or unimodal) if its domain and all its sublevel sets
$S_{\alpha}=\{ x\in \mathbf{dom} f \mid f(x)<\alpha \}$
Are convex.

3.4.2 Basic properties

The extension of Jenson’s equality is: A function $f$ is quasiconvex if $\mathbf{dom}f$ is convex and for $x,y\in \mathbf{dom}f$ , $f(\theta x+(1-\theta)y)\leq\max\{f(x),f(y)\}$

3.4.3 Differentiable quasiconvex functions

First-order conditions

Suppose $f:\mathbf{R}^n\rightarrow\mathbf{R}$ is differentiable. Then $f$ is quasiconvex if and only if dom $f$ is convex and for all $\mathbf{dom}f$
$f(y)\leq f(x) \Rightarrow\nabla f(x)^T(y-x)\leq0$
Second-order conditions

If f is quasiconvex, then for all $\mathbf{dom}f$ , and all $\mathbf{R}^n$ , we have
$y^T\nabla f(x)=0\Rightarrow y^T \nabla ^2f(x)y\geq0$

3.4.4 Operations that preserve quasiconvexity

Nonnegative weighted maximum

If $w_i>0,f_i(x)$ is quasiconvex,
$f=\max\{\sum_iw_if(x) \}$
is quasi convex.
$f(x)=\sup_{y\in C}\{ w(y)f(x,y) \}$
is quasi convex.

Minimization

If $f (x, y)$ is quasiconvex jointly in $x$ and $y$ and $C$ is a convex set, then the function
$\inf_{y\in C}f(x,y)$
is quasi convex.

3.4.5 Representation via family of convex functions

We seek a family of convex functions $\phi_t : \mathbf{R}^n\rightarrow\mathbf{R}$ , indexed by $\mathbf{R}$ , with
$f(x)\leq t \Leftrightarrow\phi_t\leq0$

3.5 Log-concave and log-convex functions

3.5.1 Definition

A function $\mathbf{R}^n\rightarrow\mathbf{R}$ is logarithmically concave or log-concave if $ f(x) > 0$ for all $\mathbf{dom}f$ and $\log f$ is concave.

if for all $y∈\mathbf{dom}f$ and $0 \leq θ \leq 1$ ,we have
$f(\theta x+(1-\theta)y)\leq f(x)^{\theta}f(y)^{1-\theta}$

3.5.2 Properties

Twice differentiable log-convex/concave functions

We conclude that $f$ is log-convex if and only if for all $\mathbf{dom} f$ ,
$f(x)\nabla^2f(x) \succeq \nabla f(x)\nabla f(x)^T$
and log-concave if and only if for all $\mathbf{dom} f$ ,
$f(x)\nabla^2f(x) \preceq \nabla f(x)\nabla f(x)^T$
Multiplication, addition, and integration

Log-convexity and log-concavity are closed under multiplication and positive scaling.

The sum of two log-convex functions is log-convex. If $f (x, y)$ is log-convex in $x$ for every $y$ , then
$g(x)=\int f(x,y)dy$
is log-convex.

Integration of log-concave functions

If $\mathbf{R}^n \times \mathbf{R}^m\rightarrow\mathbf{R}$ is log-concave, then $g(x)=\int f(x,y)dy$ is log-concave.

3.6 Convexity with respect to generalized inequalities

3.6.1 Monotonicity with respect to a generalized inequality

Suppose $\mathbf{R}^n$ is a proper cone with associated generalized inequality $\preceq_K$ . A function $\mathbf{R}^n\rightarrow\mathbf{R}$ is called $K$ -nondecreasing if
$x\preceq_Ky \Longrightarrow f(x)\leq f(y)$
and $K$ -increasing if
$x\preceq_Ky,x\neq y \Longrightarrow f(x)< f(y)$

3.6.2 Convexity with respect to a generalized inequality

Suppose $\mathbf{R}^m$ is a proper cone with associated generalized inequality $\preceq_K$ . We say $\mathbf{R}^n\rightarrow\mathbf{R}^m$ is $K$ -convex if for all $KaTeX parse error: Undefined control sequence: \mbox at position 7: x, y, \̲m̲b̲o̲x̲{and}\space 0 ≤…$ ,
$f(\theta x+(1-\theta)y)\preceq_K \theta f(x)+(1-\theta)f(y)$
Differentiable K-convex functions

A differentiable function $f$ is $K$ -convex if and only if its domain is convex, and for all $y∈\mathbf{dom}f$ ,
$f(y)\succeq_Kf(x)+Df(x)(y-x)$
Here $Df(x)\in\mathbf{R}^{m\times n}$ is the Jacobian matrix of $f$ at $x$ .