矩阵低秩近似Eckart-Young Theorem

Nightmare004

已于 2023-07-31 11:16:40 修改

阅读量4.6k

点赞数 9

分类专栏：数学文章标签：矩阵线性代数

于 2022-03-10 16:24:03 首次发布

本文链接：https://blog.csdn.net/qq_39942341/article/details/123403100

版权

数学专栏收录该内容

145 篇文章

订阅专栏

引理1

设 $\mathbf{U},\mathbf{V}$ 是酉矩阵
$\|\mathbf{U}\mathbf{A}\|_2=\|\mathbf{A}\mathbf{V}\|_2=\|\mathbf{U}\mathbf{A}\mathbf{V}\|_2=\|\mathbf{A}\|_2$
证明：
利用 $\|\mathbf{A}\|_2=\sqrt{\lambda_{max}\left(\mathbf{A}^H\mathbf{A}\right)}$ 就很显然了

引理2

设 $\mathbf{A},\mathbf{B}\in\mathbb{C}^{m\times n}$ , $q=\min\left\{m,n\right\}$
$\sigma_1\ge\sigma_2\ge \cdots\ge\sigma_q$ 代表奇异值
则
$\sigma_{i+j-1}\left(\mathbf{A}+\mathbf{B}\right)\le \sigma_i\left(\mathbf{A}\right)+\sigma_j\left(\mathbf{B}\right)$
其中 $1\le i,j\le q,i+j\le q+1$
证明：
对 $\mathbf{A},\mathbf{B}$ 做SVD分解
$\mathbf{A}=\mathbf{V}\mathbf{\Sigma}_{\mathbf{A}}\mathbf{W}^H$
$\mathbf{B}=\mathbf{X}\mathbf{\Sigma}_{\mathbf{B}}\mathbf{Y}^H$
设
$\mathbf{W}=\left(\mathbf{w}_1,\cdots,\mathbf{w}_n\right),\mathbf{Y}=\left(\mathbf{y}_1,\cdots,\mathbf{y}_n\right)$
$\mathbf{V}=\left(\mathbf{w}_1,\cdots,\mathbf{w}_m\right),\mathbf{X}=\left(\mathbf{y}_1,\cdots,\mathbf{y}_m\right)$

设 $\mathbf{S}'=\operatorname{span}\left\{\mathbf{w}_i,\cdots,\mathbf{w}_n\right\}$
$\mathbf{S}''=\operatorname{span}\left\{\mathbf{y}_i,\cdots,\mathbf{y}_n\right\}$
注意到
$\begin{aligned} v&=\operatorname{dim}\left(\mathbf{S}'\cap\mathbf{S}''\right)\\ &=\operatorname{dim}\left(\mathbf{S}'\right)+\operatorname{dim}\left(\mathbf{S}''\right)-\operatorname{dim}\left(\mathbf{S}'\cup\mathbf{S}''\right)\\ &=n-i+1+n-j+1-\operatorname{dim}\left(\mathbf{S}'\cup\mathbf{S}''\right)\\ &\ge n-i+1+n-j+1-n\\ &=n-i+1-j+1\\ &\ge1 \end{aligned}$
利用Min-max theorem
$n-i+1+n-j+1-v=\operatorname{dim}\left(\mathbf{S}'\cup\mathbf{S}''\right)<=n\Rightarrow n-v+1\le i+j-1$
所以
$\begin{aligned} \sigma_{i+j-1}\left(\mathbf{A}+\mathbf{B}\right)&\le\sigma_{n-v+1}\left(\mathbf{A}+\mathbf{B}\right)\\ &=\min_{\mathbf{S}\in\mathbb{C}\atop \operatorname{dim}\left(\mathbf{S}\right)=v}\max_{\mathbf{x}\in\mathbf{S}\atop\|\mathbf{x}\|=1}\|\left(\mathbf{A}+\mathbf{B}\right)\mathbf{x}\|\\ &\le\max_{\mathbf{x}\in\mathbf{S}'\cap \mathbf{S}''\atop\|\mathbf{x}\|=1}\|\left(\mathbf{A}+\mathbf{B}\right)\mathbf{x}\|\\ &\le\max_{\mathbf{x}\in\mathbf{S}'\cap \mathbf{S}''\atop\|\mathbf{x}\|=1}\|\mathbf{A}\mathbf{x}\|+\max_{\mathbf{x}\in\mathbf{S}'\cap \mathbf{S}''\atop\|\mathbf{x}\|=1}\|\mathbf{B}\mathbf{x}\|\\ &\le\max_{\mathbf{x}\in\mathbf{S}'\atop\|\mathbf{x}\|=1}\|\mathbf{A}\mathbf{x}\|+\max_{\mathbf{x}\in\mathbf{S}''\atop\|\mathbf{x}\|=1}\|\mathbf{B}\mathbf{x}\|\\ &=\sigma_i\left(\mathbf{A}\right)+\sigma_j\left(\mathbf{B}\right) \end{aligned}$

Eckart-Young Theorem

设矩阵 $\mathbf{A}$ 有SVD分解 $\mathbf{A}=\mathbf{U}\mathbf{\Sigma}\mathbf{V}^T$ ,其中 $\mathbf{U},\mathbf{V}$ 为正交矩阵
设 $k<r=\operatorname{rank}\left(\mathbf{A}\right)$
$\mathbf{A}_k=\sum_{i=1}^{k}\sigma_i\mathbf{u}_i\mathbf{v}_i^T$
其中 $\sigma_i$ 为 $\mathbf{A}$ 的奇异值，设 $\mathbf{A}$ 有 $p$ 个奇异值
$\sigma_1\ge \sigma_2\ge \cdots\ge \sigma_r>\sigma_{r+1}=\cdots=\sigma_p=0$
则
$\min_{\operatorname{rank}\left(\mathbf{B}\right)=k}\|\mathbf{A}-\mathbf{B}\|_2=\|\mathbf{A}-\mathbf{A}_k\|_2=\sigma_{k+1}$
$\min_{\operatorname{rank}\left(\mathbf{B}\right)=k}\|\mathbf{A}-\mathbf{B}\|_F=\|\mathbf{A}-\mathbf{A}_k\|_F=\sqrt{\sum_{i=k+1}^{p}\sigma_i^2}$

证明

二范数形式

$\mathbf{A}=\sum_{i=1}^{p}\sigma_i\mathbf{u}_i\mathbf{v}_i^T$
$\mathbf{A}_k=\sum_{i=1}^{k}\sigma_i\mathbf{u}_i\mathbf{v}_i^T$
$\mathbf{A}-\mathbf{A}_k$ 的前 $k$ 个奇异值为0，剩下的 $p - k$ 个奇异值为 $\sigma_{k+1},\cdots,\sigma_{p}$
因为 $\|\mathbf{A}\|_2=\sigma_1=\sigma_{max}$
于是 $\|\mathbf{A}-\mathbf{A}_{k+1}\|_2=\sigma_{k+1}$

接着证明其他的解>=最优解
因为 $\operatorname{rank}\left(\mathbf{B}\right)=k$
$\operatorname{dim}N\left(\mathbf{B}\right)=n-k$
于是存在 $n - k$ 个标准正交向量 $\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}$ ,使得
$N\left(\mathbf{B}\right)=span\left\{\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}\right\}$
又因为
$\operatorname{dim}span\left\{\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}\right\}+\operatorname{dim}span\left\{\mathbf{v}_1,\cdots,\mathbf{v}_{k+1}\right\}=n+1>n$
所以
$span\left\{\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}\right\}\cap span\left\{\mathbf{v}_1,\cdots,\mathbf{v}_{k+1}\right\}\neq\left\{0\right\}$
存在 $\mathbf{z}\left(\|\mathbf{z}\|=1\right)$ ,使得 $\mathbf{Bz}=0$ 且 $\mathbf{z}\in span\left\{\mathbf{v}_1,\cdots,\mathbf{v}_{k+1}\right\}$
$\mathbf{z}=\sum_{i=1}^{k+1}k_i\mathbf{v}_i$ 并且 $\sum_{i=1}^{k+1}k_i^2=1$

$\begin{aligned} &\|\mathbf{A}-\mathbf{B}\|_2\\ =&\|\mathbf{A}-\mathbf{B}\|_2\|\mathbf{z}\|\\ \ge&\|\left(\mathbf{A}-\mathbf{B}\right)\mathbf{z}\|\\ =&\|\mathbf{Az}\|\\ =&\|\mathbf{U}\mathbf{\Sigma}\mathbf{V}^T\mathbf{z}\|\\ =&\|\mathbf{\Sigma}\mathbf{V}^T\mathbf{z}\|\\ =&\sum_{i=1}^{p}\left(\sigma_i\mathbf{v}_i^T\mathbf{z}\right)^2\\ =&\sum_{i=1}^{k+1}\left(\sigma_i\mathbf{v}_i^T\mathbf{z}\right)^2\\ \ge&\sigma_{k+1}\sum_{i=1}^{k+1}\left(\mathbf{v}_i^T\mathbf{z}\right)^2\\ =&\sigma_{k+1}\sum_{i=1}^{k+1}k_i^2\\ =&\sigma_{k+1} \end{aligned}$

F范数形式

利用引理2
$\begin{aligned} &\|\mathbf{A}-\mathbf{A}_k\|_F^2\\ =&\sum_{i=k+1}^r\sigma_i\left(\mathbf{A}\right)\\ =&\sum_{i=k+1}^r\sigma_i\left(\mathbf{A}-\mathbf{B}+\mathbf{B}\right)\\ \le&\sum_{i=k+1}^r\left(\sigma_{i-k}\left(\mathbf{A}-\mathbf{B}\right)+\sigma_{k+1}\left(\mathbf{B}\right)\right)\\ =&\sum_{i=k+1}^r\sigma_{i-k}\left(\mathbf{A}-\mathbf{B}\right)\\ \le&\sum_{i=1}^{r-k}\sigma_{i}\left(\mathbf{A}-\mathbf{B}\right)\\ \le&\|\mathbf{A}-\mathbf{B}\|_F^2 \end{aligned}$
所以成立