奇异值分解之 Weyl 不等式及其变体

wzf@robotics_notes

已于 2023-11-07 23:07:55 修改

阅读量1.6k

点赞数 10

分类专栏：数学基础文章标签：矩阵线性代数机器人

于 2023-11-07 20:38:09 首次发布

本文链接：https://blog.csdn.net/woyaomaishu2/article/details/134275617

版权

数学基础专栏收录该内容

14 篇文章 0 订阅

订阅专栏

Title: 奇异值分解之 Weyl 不等式及其变体

文章目录

引言
I. 子空间相交 Subspace Intersection
- 1. 子空间相交引理
- 2. 子空间相交定理
II. 特征值形式的 Weyl 不等式 Weyl's Inequality for Eigenvalues
III. 奇异值形式的 Weyl 不等式 Weyl's Inequality for Singular Values
总结
参考文献

引言

Weyl 不等式常用于扰动分析, 本博文整理 Weyl 不等式 (Weyl inequality, 或 Weyl 定理, Weyl theorem) 是为了 Frobenius-范数下奇异值低秩近似的证明.

主要涉及 Weyl 不等式两种变体形式:

- 特征值形式

- 奇异值形式

在正式开始 Weyl 不等式的了解前, 我们先看一下子空间相交定理, 作为 Weyl 不等式证明时所用之基础.

相关博文介绍

- 奇异值分解之常用结论

- 奇异值分解之 Courant-Fischer 定理及其变体

- 奇异值分解之 Weyl 不等式及其变体

- 奇异值分解之 Frobenious-范数下低秩近似的证明

I. 子空间相交 Subspace Intersection

1. 子空间相交引理

[Subspace intersection lemma]^[1] Let $\mathit{V}$ be a finite-dimensional vector space and let $\mathit{S}_1$ and $\mathit{S}_2$ be two given subspaces of $\mathit{V}$ . Then
$\dim (\mathit{S}_1 \cap \mathit{S}_2) + \dim (\mathit{S}_1 + \mathit{S}_2) = \dim(\mathit{S}_1)+ \dim(\mathit{S}_2) \tag{I-1}$
Rewriting this identity as,
$\begin{aligned} \dim (\mathit{S}_1 \cap \mathit{S}_2) &= \dim\mathit{S}_1+ \dim\mathit{S}_2 - \dim (\mathit{S}_1 + \mathit{S}_2)\\ &\geq \dim\mathit{S}_1+ \dim\mathit{S}_2 - \dim \mathit{V} \end{aligned} \tag{I-2}$
That is to say, that if $\delta = \dim \mathit{S}_1 + \dim \mathit{S}_2 -\dim \mathit{V} \geq 1 $, then the subspace $\mathit{S}_1 \cap \mathit{S}_2$ has dimension at least $\delta$ .

If $\mathit{S}_1, \mathit{S}_2,\ldots ,\mathit{S}_k$ are subspaces of $\mathit{V}$ , an induction argument is
$\dim (\mathit{S}_1 \cap \mathit{S}_2 \cap\ldots \cap \mathit{S}_k) \geq \dim\mathit{S}_1+ \dim\mathit{S}_2 +\cdots + \dim\mathit{S}_k - (k-1) \dim \mathit{V} \tag{I-3}$
This shows that if $\delta = \dim \mathit{S}_1 + \dim \mathit{S}_2 + \dots + \dim \mathit{S}_k- (k-1)\dim \mathit{V} \geq 1$ , then $\dim(\mathit{S}_1 \cap \mathit{S}_2 \cap\ldots \cap \mathit{S}_k)\geq \delta$ . ( $\geq 2$ )

Proof

式 (I-1) 和式 (I-2), 显然成立.

利用归纳法证明式 (I-3). 假设 $k - 1$ 情况下成立, 即
$\dim (\mathit{S}_1 \cap \mathit{S}_2 \cap\ldots \cap \mathit{S}_{k-1}) \geq \dim\mathit{S}_1+ \dim\mathit{S}_2 +\cdots + \dim\mathit{S}_{k-1} - (k-2) \dim \mathit{V} \tag{I-4}$
那么 $k$ 情况下, 由式 (I-2) 和式 (I-4) 可知
$\begin{aligned} \dim (\mathit{S}_1 \cap\ldots \cap \mathit{S}_{k-1} \cap \mathit{S}_{k}) & = \dim \left((\mathit{S}_1 \cap\ldots \cap \mathit{S}_{k-1}) \cap \mathit{S}_{k}\right)\\ {\small\text{(I-2)}} \quad &\geq \dim (\mathit{S}_1 \cap\ldots \cap \mathit{S}_{k-1} ) + \dim(\mathit{S}_{k}) - \dim \mathit{V}\\ {\small\text{(I-4)}} \quad & = \dim\mathit{S}_1 +\cdots + \dim\mathit{S}_{k-1} + \dim\mathit{S}_{k} - (k-1) \dim \mathit{V} \end{aligned}$
证毕.

2. 子空间相交定理

[Subspace intersection]^[1] Let $\mathit{S}_1, \ldots, \mathit{S}_k$ be given subspaces of $\mathbb{R}^n$ . If $\delta = \dim(S_1) + \dots + \dim S_k − (k − 1)n \geq 1$ , there are orthonormal vectors $x_1, \ldots , x_{\delta}$ such that $x_1 , \ldots , x_\delta \in \mathit{S}_i$ for every $\ldots , k$ . In particular, $S_1 ∩ \ldots ∩ S_k$ contains a unit vector.

Proof

首先, 集合 $S_1 ∩ \ldots ∩ S_k$ 是子空间.

应用子空间相交引理可知, $\dim(\mathit{S}_1 \cap \ldots \cap \mathit{S}_k) \geq \delta \geq 1$ . 令 $x_1 , \ldots , x_\delta$ 是子空间 $S_1 ∩ \ldots ∩ S_k$ 的任意的包含 $\delta$ 个元素的正交基.

证毕.

II. 特征值形式的 Weyl 不等式 Weyl’s Inequality for Eigenvalues

[Weyl’s inequality]^[2] Let $\mathbf{M}= \mathbf{N}+\mathbf{R}$ , $\mathbf{N}$ , and $\mathbf{R}$ be $n\times n$ symmetric matrices, with their respective eigenvalues ordered as $\lambda_{1}(\mathbf{M})\geq \ldots\geq \lambda_{n}(\mathbf{M})$ , $\lambda_{1}(\mathbf{N})\geq \ldots\geq \lambda_{n}(\mathbf{N})$ , and $\lambda_{1}(\mathbf{R})\geq \ldots\geq \lambda_{n}(\mathbf{R})$ .

Then the following inequalities hold:
$\lambda_i(\mathbf{N})+\lambda_n(\mathbf{R}) \leq \lambda_{i}(\mathbf{M}) \leq \lambda_i(\mathbf{N})+\lambda_1(\mathbf{R}) \tag{II-0-1}$
for $i=1,\ldots,n$ .

More generally,
$\lambda_j(\mathbf{N})+\lambda_k(\mathbf{R}) \leq \lambda_{i}(\mathbf{M}) \leq \lambda_r(\mathbf{N})+\lambda_s(\mathbf{R}) \tag{II-0-2}$
for $\geq i \geq r+s-1$ .

In particular, if $\mathbf{R}$ is positive definite then plugging $\lambda_n(\mathbf{R}) > 0$ into the above inequalities leads to
$\lambda_{i}(\mathbf{M}) > \lambda_i(\mathbf{N}) \tag{II-0-3}$
for $=1,\dots,n$ .

Proof

先证明一般形式 (II-0-2).

1. 一般形式的证明

一般形式 (II-0-2) 中又可分为两个不等式, 即

第一个不等式:
$\lambda_{i}(\mathbf{M})\geq \lambda_j(\mathbf{N})+\lambda_k(\mathbf{R}) , \quad (\text{for}\;j+k-n \geq i \geq 1) \tag{II-1-1}$
第二个不等式:
$\lambda_{i}(\mathbf{M}) \leq \lambda_r(\mathbf{N})+\lambda_s(\mathbf{R}),\quad (\text{for}\;n \geq i \geq r+s-1 ) \tag{II-1-2}$

$i=1,\ldots,n$ , 显然 $n\geq i\geq 1$ .

A. 第一个不等式的证明

我们考虑利用 Courant-Fischer 定理来证明, 故先要构建特殊子空间, 再由特殊子空间扩展到一般子空间, 最后完成证明.

因为 $\mathbf{M}$ 是对称矩阵, 故都存在着对应于特征值 $\lambda_i(\mathbf{M})$ 的正交特征向量 $\mathbf{m}_i$ ( $i=1,2,\ldots,n$ ). 同理,

因为 $\mathbf{N}$ 是对称矩阵, 故都存在着对应于特征值 $\lambda_i(\mathbf{N})$ 的正交特征向量 $\mathbf{n}_i$ ( $i=1,2,\ldots,n$ ).

因为 $\mathbf{R}$ 是对称矩阵, 故都存在着对应于特征值 $\lambda_i(\mathbf{R})$ 的正交特征向量 $\mathbf{r}_i$ ( $i=1,2,\ldots,n$ ).

在线性空间 $\mathbb{R}^n$ 内, 定义 2 个特殊的子空间
$\mathit{S}_n \triangleq {\rm span}\{\mathbf{n}_1, \mathbf{n}_2, \ldots, \mathbf{n}_j \}, \quad \dim{\mathit{S}_n}=j \tag{II-1-A-1}$

$\mathit{S}_r \triangleq {\rm span}\{\mathbf{r}_1, \mathbf{r}_2, \ldots, \mathbf{r}_k\}, \quad \dim{\mathit{S}_r}=k \tag{II-1-A-2}$

根据子空间相交引理, 定义交集构成的子空间的维度
$\triangleq \dim(\mathit{S}_n\cap \mathit{S}_r) = \dim\mathit{S}_n + \dim\mathit{S}_r - \dim(\mathit{S}_n+\mathit{S}_r) \tag{II-1-A-3}$
因为 $\dim(\mathit{S}_n+\mathit{S}_r) \leq n$ , 故有
$\geq j+k-n \geq i \geq 1 \tag{II-1-A-4}$
所以集合 $\mathit{S}_n\cap \mathit{S}_r$ 构成维度不小于 1 的子空间, 也就说 $\exist x \in \mathit{S}_n\cap \mathit{S}_r$ .

由特征值的降序排列可知
$\lambda_{v}(\mathbf{M}) \leq \lambda_{j+k-n}(\mathbf{M}) \leq \lambda_{i}(\mathbf{M}) \tag{II-1-A-5}$
由 Courant-Fischer 定理中式 (II-2) 可知
$\begin{aligned} \lambda_{v}(\mathbf{M}) & = \max_{\begin{array}{c}\mathit{S} \subseteq \mathbb{R}^n\\ {\rm dim}({\mathit{S}}) = v \end{array} } \min_{\begin{array}{c}\mathbf{x} \in \mathit{S}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{M} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ & \geq \min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\cap \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{M} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ & \geq \min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\cap \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{N} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}} + \min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\cap \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{R} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ & \geq \min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{N} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}} + \min_{\begin{array}{c}\mathbf{x} \in \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{R} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ &\geq \lambda_j(\mathbf{N}) + \lambda_k(\mathbf{R}) \end{aligned}\tag{II-1-A-6}$
对上式推导过程解释如下:

- 式 (II-1-A-6) 中第一行和第二行组成的不等式是通过缩小极小值的搜索空间. 初始时是从所有的维度为 $v$ 的子空间中搜索极小值, 并从这些极小值中找到最大的那个值; 缩小搜索空间后, 只在 $\mathit{S}_n\cap \mathit{S}_r$ 这一特殊 $v$ 维度子空间中寻找极小值. 自然从更广搜索域中找到的极小值中的极大值比局部搜索域中找的极小值更大.

- 式 (II-1-A-6) 中第二行和第三行组成的不等式, 是因为放松了 $\mathbf{M}=\mathbf{N}+\mathbf{R}$ 之间的关联, 使得两部分自由取各自的极小值, 这样能够取得更小的结果, 故等式成立. 再详细一点是在 $\min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\cap \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{M} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}$ 中取到的极小值也可以应用到 $\min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\cap \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{N} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}} + \min_{\begin{array}{c}\mathbf{x} \in \mathit{S_n}\cap \mathit{S_r}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{R} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}$ 且结果是一样的; 而反之则不行, 因为后者是两部分分开取不同的值.

- 式 (II-1-A-6) 中第三行和第四行组成的不等式, 是因为扩大极小值的搜索范围后获得极小值更小.

- 式 (II-1-A-6) 中第四行和第五行组成的不等式参考 Courant-Fischer 定理中式 (II-1-A-3), 本质是瑞利定理 (Rayleigh theorem).

联立式 (II-1-A-5) 与式 (II-1-A-6) 可知
$\lambda_{i}(\mathbf{M}) \geq \lambda_j(\mathbf{N}) + \lambda_k(\mathbf{R}) , \quad (\text{for}\;j+k-n \geq i \geq 1) \tag{II-1-A-7}$
第一个不等式的证明完毕.

B. 第二个不等式的证明

构建子空间
$\mathit{S}^{'}_n \triangleq {\rm span}\{\mathbf{n}_r, \ldots, \mathbf{n}_n \}, \quad \dim{\mathit{S}^{'}_n}=n-r+1 \tag{II-1-B-1}$

$\mathit{S}^{'}_r \triangleq {\rm span}\{\mathbf{r}_s, \ldots, \mathbf{r}_n\}, \quad \dim{\mathit{S}^{'}_r}=n-s+1 \tag{II-1-B-2}$

利用子空间相交引理, 定义交集构成的子空间的维度
$\begin{aligned} v^{'} &\triangleq \dim(\mathit{S}_n^{'}\cap \mathit{S}_r^{'}) \\ &= \dim(\mathit{S}_n^{'}) + \dim(\mathit{S}_r^{'}) - \dim(\mathit{S}_n^{'}+\mathit{S}_r^{'}) \\ &\geq (n-r+1) +(n-s+1)-n \\ &= n-(r+s-1)+1 \\ &\geq 1 \end{aligned} \tag{II-1-B-3}$
根据条件式 $\geq i \geq r+s-1$ , 上式最后一步中的不等式成立.

因为 $v^{'} \geq 1$ , 所以 $\exist y \in \mathit{S}_n^{'}\cap \mathit{S}_r^{'}$ .

对式 (II-1-B-3) 移项, 得到
$i\geq r+s-1 \geq n-v^{'} + 1 \tag{II-1-B-4}$
因为特征值降序排列, 可知
$\lambda_{i} (\mathbf{M}) \leq \lambda_{r+s-1} (\mathbf{M}) \leq \lambda_{n-v^{'} + 1} (\mathbf{M}) \tag{II-1-B-5}$
由 Courant-Fischer 定理中式 (II-3) 可知
$\begin{aligned} \lambda_{n-v^{'}+1} (\mathbf{M}) & = \min_{\begin{array}{c}\mathit{T} \subseteq \mathbb{R}^n\\ {\rm dim}({\mathit{T}}) = n-(n-v^{'}+1)+1 \end{array} } \max_{\begin{array}{c}\mathbf{x} \in \mathit{T}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{M} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ & = \min_{\begin{array}{c}\mathit{T} \subseteq \mathbb{R}^n\\ {\rm dim}({\mathit{T}}) = v^{'} \end{array} } \max_{\begin{array}{c}\mathbf{x} \in \mathit{T}\\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{M} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ & \leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_n^{'}\cap \mathit{S}_r^{'} \\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{M} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ &\leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_n^{'}\cap \mathit{S}_r^{'} \\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{N} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}} + \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_n^{'}\cap \mathit{S}_r^{'} \\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{R} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ &\leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_n^{'} \\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{N} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}} + \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_r^{'} \\ \mathbf{x}\neq \mathbf{0} \end{array} } \frac{\mathbf{x}^{\small\rm T} \mathbf{R} \mathbf{x}}{\mathbf{x}^{\small\rm T} \mathbf{x}}\\ & \leq \lambda_r(\mathbf{N}) + \lambda_s(\mathbf{R}) \end{aligned} \tag{II-1-B-6}$
对上式推导过程解释如下:

- 式 (II-1-B-6) 中第二行和第三行组成的不等式, 是因为将 min-max 变为了 max, 同时缩小搜索空间.

- 式 (II-1-B-6) 中第三行和第四行组成的不等式, 是因为释放了两部分的关联 (即放松了约束条件), 故能取得更大的极大值.

- 式 (II-1-B-6) 中第四行和第五行组成的不等式, 是因为扩大了极大值的搜索范围.

- 式 (II-1-B-6) 中第五行和第六行组成的不等式参考 Courant-Fischer 定理中式 (II-1-B-4), 本质是瑞利定理.

联立式 (II-1-B-5) 和式 (II-1-B-6) 可知
$\lambda_{i} (\mathbf{M}) \leq \lambda_r(\mathbf{N}) + \lambda_s(\mathbf{R}), \qquad (n \geq i \geq r+s-1) \tag{II-1-B-7}$
第二个不等式的证明完毕.

2. 特殊形式的证明

下面证明特殊形式 (II-0-1). 已证明了一般形式
$\lambda_j(\mathbf{N})+\lambda_k(\mathbf{R}) \leq \lambda_{i}(\mathbf{M}) \leq \lambda_r(\mathbf{N})+\lambda_s(\mathbf{R}) \quad (\text{for}\;\;j+k-n \geq i \geq r+s-1) \tag{II-2-1}$
取 $j = i, k = n, r = i, s = 1$ , 代入不等式可得
$\lambda_i(\mathbf{N})+\lambda_n(\mathbf{R}) \leq \lambda_{i}(\mathbf{M}) \leq \lambda_i(\mathbf{N})+\lambda_1(\mathbf{R}) \tag{II-2-2}$
代入条件式后, 条件式也成立, 即
$\begin{aligned} & j+k-n \geq i \geq r+s-1\\ \Rightarrow \quad& i+n-n \geq i \geq i+1-1\\ \Rightarrow \quad& i\geq i\geq i \end{aligned} \tag{II-2-3}$
特殊形式 (II-0-1) 证明完毕.

3. 正定扰动的证明

下面证明正定扰动式 (II-0-3). 已证明了特殊形式 (II-0-1), 即
$\lambda_i(\mathbf{N})+\lambda_n(\mathbf{R}) \leq \lambda_{i}(\mathbf{M}) \tag{II-3-1}$
已知条件 $\lambda_n(\mathbf{R}) > 0$ . 故有
$\lambda_i(\mathbf{N}) < \lambda_i(\mathbf{N})+\lambda_n(\mathbf{R}) \leq \lambda_{i}(\mathbf{M}) \tag{II-3-2}$
正定扰动式 (II-0-3) 得证.

这样 Weyl 不等式 (针对特征值) 全部证明完毕.

III. 奇异值形式的 Weyl 不等式 Weyl’s Inequality for Singular Values

[Weyl’s inequality]^[3] Let $\mathbf{A}, \mathbf{B}\in \mathbf{M}_{m, n}$ be given and let $q = \min\{m,n\}$ . The following inequality holds for the decreasingly ordered singular values of $\mathbf{A}$ , $\mathbf{B}$ , and $\mathbf{A}+ \mathbf{B}$ :
$\sigma_{i+j-1} (\mathbf{A} + \mathbf{B}) \leq \sigma_i(\mathbf{A})+ \sigma_j(\mathbf{B}) \tag{III-1}$
for $\leq i,j \leq q$ and $\leq q+1$ .

Proof^[3]

证明同 “B. 第二个不等式的证明” 类似.

假设 $\mathbf{A}$ 和 $\mathbf{B}$ 的奇异值分解为
$\mathbf{A} = \mathbf{V}\boldsymbol{\Sigma}_{A} \mathbf{W}^{\small\rm T}\tag{III-2}$
其中 $\times n$ 正交矩阵 $\mathbf{W} = \begin{bmatrix} \mathbf{w}_1, \ldots, \mathbf{w}_n \end{bmatrix}$ .
$\mathbf{B} = \mathbf{X}\boldsymbol{\Sigma}_{B} \mathbf{Y}^{\small\rm T}\tag{III-3}$
其中 $\times n$ 正交矩阵 $\mathbf{Y} = \begin{bmatrix} \mathbf{y}_1, \ldots, \mathbf{y}_n \end{bmatrix}$ .

定义子空间
$\mathit{S}_w \triangleq {\rm span}\{\mathbf{w}_i, \ldots, \mathbf{w}_n\}, \;\;\;\dim\mathit{S}_w = n-i+1 \tag{III-4}$

$\mathit{S}_y \triangleq {\rm span}\{\mathbf{y}_j, \ldots, \mathbf{y}_n\}, \;\;\;\dim\mathit{S}_y = n-j+1 \tag{III-5}$

定义交集形成的子空间的维度
$\begin{aligned} v &\triangleq \dim (\mathit{S}_w + \mathit{S}_y)\\ &= \dim \mathit{S}_w + \dim \mathit{S}_y - \dim (\mathit{S}_w \cap \mathit{S}_y)\\ & \geq (n-i+1) + (n-j+1) -n\\ &= n-(i+j-1)+1 \\ &\geq n-(q+1-1)+1\\ &\geq 1 \end{aligned}\tag{III-6}$
上式利用了子空间相交引理以及条件式 $\leq q+1$ 和 $q = \min\{m,n\}$ .

因为 $\geq 1$ , 所以 $\exist \mathbf{x} \in \mathit{S}_w \cap \mathit{S}_y$ .

对式 (III-6) 移项得到
$\geq n-v+1 \tag{III-7}$
因为奇异值的降序排列, 可得
$\sigma_{i+j-1} (\mathbf{A}+\mathbf{B}) \leq \sigma_{n-v+1}(\mathbf{A}+\mathbf{B}) \tag{III-8}$
根据 Courant-Fischer Theorem for Singular Values, 可知
$\begin{aligned} \sigma_{n-v+1}(\mathbf{A}+\mathbf{B}) &= \min_{\begin{array}{c}\mathit{S} \subseteq \mathbb{R}^n\\ {\rm dim}({\mathit{S}}) = v \end{array} } \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}\\ \|\mathbf{x}\|_2 = 1 \end{array} } \| (\mathbf{A}+\mathbf{B}) \mathbf{x}\|_2\\ & \leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w\cap \mathit{S}_y \\ \|\mathbf{x}\|_2 = 1 \end{array} } \| (\mathbf{A}+\mathbf{B}) \mathbf{x}\|_2\\ & \leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w\cap \mathit{S}_y \\ \|\mathbf{x}\|_2 = 1 \end{array} } \left(\| \mathbf{A}\mathbf{x}\|_2 + \| \mathbf{B} \mathbf{x}\|_2\right)\\ & \leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w\cap \mathit{S}_y \\ \|\mathbf{x}\|_2 = 1 \end{array} } \| \mathbf{A}\mathbf{x}\|_2 + \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w\cap \mathit{S}_y \\ \|\mathbf{x}\|_2 = 1 \end{array} } \| \mathbf{B} \mathbf{x}\|_2\\ & \leq \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w \\ \|\mathbf{x}\|_2 = 1 \end{array} } \| \mathbf{A}\mathbf{x}\|_2 + \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_y \\ \|\mathbf{x}\|_2 = 1 \end{array} } \| \mathbf{B} \mathbf{x}\|_2\\ & = \sigma_i(\mathbf{A}) + \sigma_j(\mathbf{B}) \end{aligned} \tag{III-9}$
对上式推导过程解释如下:

- 式 (III-9) 中第一行和第二行组成的不等式, 是因为将 min-max 变为了 max, 同时缩小搜索空间.

- 式 (III-9) 中第二行和第三行组成的不等式, 是因为 2-范数的三角不等式.

- 式 (III-9) 中第三行和第四行组成的不等式, 是因为释放了两部分的关联 (即放松了约束条件), 故能取得更大的极大值.

- 式 (III-9) 中第四行和第五行组成的不等式, 是因为扩大了极大值的搜索范围.

- 式 (III-9) 中第五行和第六行组成的等式本质是瑞利定理, 简单说明如下:
$\max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w \\ \|\mathbf{x}\|_2 = 1 \end{array} } \| \mathbf{A} \mathbf{x}\|_2^2 = \max_{\begin{array}{c}\mathbf{x} \in \mathit{S}_w \\ \|\mathbf{x}\|_2 = 1 \end{array} } \mathbf{x}^{\small\rm T} \mathbf{A}^{\small\rm T} \mathbf{A} \mathbf{x} \leq \lambda_i(\mathbf{A}^{\small\rm T} \mathbf{A}) = \sigma_i (\mathbf{A})^2 \tag{III-10}$
根据 “奇异值分解之常用结论” 中可知, $\mathbf{W}$ 是 $\mathbf{A}^{\small\rm T} \mathbf{A}$ 的正交特征矩阵. 因为 $\mathbf{x} \in \mathit{S}_w$ , 当 $\mathbf{x} = \mathbf{w}_i$ 时, 取得最大值 $\| \mathbf{A} \mathbf{x}\|_2^2 = \sigma_i (\mathbf{A})^2$ .

联立式 (III-8) 和式 (III-9) 得到
$\sigma_{i+j-1} (\mathbf{A}+\mathbf{B}) \leq \sigma_i(\mathbf{A}) + \sigma_j(\mathbf{B}) \tag{III-11}$
这样完成 Weyl’s Inequality for Singular Values 的证明.

总结

本篇博客整理和证明了 Weyl 不等式的两种形式:

- 特征值形式

- 奇异值形式

(如有问题, 请指正！)

参考文献

[1] Roger A. Horn, Charles R. Johnson, Matrix Analysis, Second Edition, Cambridge University Press, 2012

[2] HandWiki, “Weyl’s inequality”, https://handwiki.org/wiki/Weyl%27s_inequality

[3] Horn, R., Johnson, C., Topics in Matrix Analysis, Cambridge University Press, 1991

wzf@robotics_notes

关注

10
点赞
踩
8

收藏

觉得还不错? 一键收藏
0
评论
奇异值分解之 Weyl 不等式及其变体

Title: 奇异值分解之 Weyl 不等式 (Weyl Inequality)文章目录引言I. 子空间相交 Subspace Intersection1. 子空间相交引理2. 子空间相交定理II. 特征值形式的 Weyl 不等式 Weyl's Inequality for Eigenvalues1. 一般形式的证明A. 第一个不等式的证明B. 第二个不等式的证明2. 特殊形式的证明3. 正定扰动的证明III. 奇异值形式的 Weyl 不等式 Weyl's Inequality for Singular
复制链接

扫一扫