F范数低秩矩阵近似高秩矩阵_最佳frobenius低秩逼近证明-CSDN博客

本文链接：https://blog.csdn.net/qq_40379678/article/details/105259970

F范数定义

$\|A\|=\sqrt{\sum_{m,n=1,1}^{M, N}a_{m,n}^2}$

SVD-singular value decomposition

$A$ is a $m\times n$ matrix.
$A=U\Sigma V^*=U\begin{bmatrix} \Sigma_k&\mathbf{0} \\\mathbf{0}&\mathbf{0} \end{bmatrix}V^*$
$k$ is the rank of $A$ .
Then we have:
$\begin{aligned} \|A\| &=\|U\Sigma V^*\|\\ &=\sqrt{Tr((U\Sigma V^*)(U\Sigma V^*)^T)}\\ &= \sqrt{Tr(U\Sigma V^*V\Sigma^TU^T)}\\&=\sqrt{\sum_{m=1}^k\sigma_m^2}=\|\Sigma\| \end{aligned}$
Now we can form a rank $\le k$ matrix $\hat{A}$ by setting the value $\delta_{r+1},\cdots, \delta_{k}$ in $\Sigma_k$ to be zero, namely:
$\hat{A}=U\begin{bmatrix} \Sigma_r&\mathbf{0} \\\mathbf{0}&\mathbf{0} \end{bmatrix}V^*$
Then the Frobenius norm $\varepsilon_r$ of the error matrix $(A-\hat{A})$ is given by:
$\begin{aligned} \varepsilon_r&=\|(A-\hat{A})\| \\ &=\| U(\Sigma-\hat{\Sigma})V^*)\| \\ &= \| \Sigma-\hat{\Sigma}\| \\&=\sqrt{\sum_{m=r+1}^k\sigma_m^2} \end{aligned}$
Assume $B$ is already a matrix giving the minimum value of $\varepsilon_B$ and its singular value decomposition is given by:
$B=U_b\Sigma_b V^*_b=U_b\begin{bmatrix} \Sigma_b&\mathbf{0} \\\mathbf{0}&\mathbf{0} \end{bmatrix}V^*_b$
Now we define a new matrix $C$ which is given by:
$\boldsymbol{C}=\boldsymbol{U}_{b}^{H} \boldsymbol{A} \boldsymbol{V}_{b}=\left[\begin{array}{cc} \boldsymbol{C}_{11} & \boldsymbol{C}_{12} \\ \boldsymbol{C}_{21} & \boldsymbol{C}_{22} \end{array}\right]$

Then we have:
$\begin{aligned} \varepsilon_{B} &=\|\boldsymbol{A}-\boldsymbol{B}\| \\ &=\left\|\boldsymbol{U}_{b}^{H}(\boldsymbol{A}-\boldsymbol{B}) \boldsymbol{V}_{b}\right\| \\ &=\left\|\boldsymbol{C}-\boldsymbol{\Sigma}_{b}\right\| \\ &=\left\|\boldsymbol{C}_{11}-\hat{\mathbf{\Sigma}}_{b}\right\|+\left\|\boldsymbol{C}_{12}\right\|+\left\|\boldsymbol{C}_{21}\right\|+\left\|\boldsymbol{C}_{22}\right\| \end{aligned}$

since $\boldsymbol{B}$ is already a matrix giving the minimum value of $\varepsilon_{B},$ we must have $\boldsymbol{C}_{12}=\mathbf{0}$ Otherwise, we will be able to construct a new rank $r$ matrix $\hat{\boldsymbol{B}}$ , given by:
$\hat{\boldsymbol{B}}=\boldsymbol{U}_{b}\left[\begin{array}{cc} \hat{\boldsymbol{\Sigma}}_{b} & \boldsymbol{C}_{12} \\ \boldsymbol{0} & \boldsymbol{0} \end{array}\right] \boldsymbol{V}_{b}^{H}$
so that the new Frobenius norm:
$\begin{aligned} \varepsilon_{\hat{B}} &=\|\boldsymbol{A}-\hat{\boldsymbol{B}}\| \\ &=\left\|\boldsymbol{U}_{b}^{H}(\boldsymbol{A}-\hat{\boldsymbol{B}}) \boldsymbol{V}_{b}\right\| \\ &=\left\|\boldsymbol{C}_{11}-\hat{\mathbf{\Sigma}}_{b}\right\|+\left\|\boldsymbol{C}_{21}\right\|+\left\|\boldsymbol{C}_{22}\right\| \end{aligned}$
will be smaller than $\varepsilon_{B},$ which contradicts the assumption that $B$ gives the minimum value. In the same way, we have $C_{21}=0$ and $C_{11}=\hat{\mathbf{\Sigma}}_{b} .$ Then we have:
$\boldsymbol{C}=\boldsymbol{U}_{b}^{H} \boldsymbol{A} \boldsymbol{V}_{b}=\left[\begin{array}{cc} \hat{\boldsymbol{\Sigma}}_{b} & \boldsymbol{0} \\ \boldsymbol{0} & \boldsymbol{C}_{22} \end{array}\right]$
since $\hat{\mathbf{\Sigma}}_{b}$ is diagonal, it consists of $r$ singular values of $\boldsymbol{A} .$ We can get:
$\begin{aligned} \varepsilon_{B} &=\left\|\boldsymbol{C}-\boldsymbol{\Sigma}_{b}\right\| \\ &=\left\|\boldsymbol{C}_{22}\right\| \end{aligned}$
since both $U_{b}$ and $V_{b}$ are unitary matrices, we have:
$\|\boldsymbol{A}\|^{2}=\|\boldsymbol{C}\|^{2}=\left\|\hat{\boldsymbol{\Sigma}}_{b}\right\|^{2}+\left\|\boldsymbol{C}_{22}\right\|^{2}$
Then:
$\begin{aligned} \left\|c_{x}\right\|^{2} &=\|A\|^{2}-\left\|\hat{\boldsymbol{\Sigma}}_{b}\right\|^{2}\\ &=\sum_{m=1}^{k} \sigma_{m}^{2}-\left\|\hat{\boldsymbol{\Sigma}}_{b}\right\|^{2} \end{aligned}$
Obviously, when $\hat{\mathbf{\Sigma}}_{b}$ holds the $r$ largest singular values $\sigma_{1}, \ldots, \sigma_{r}$ of the matrix $\boldsymbol{A}$ $\left\|\boldsymbol{C}_{22}\right\|^{2}$ and then $\varepsilon_{B}$ reaches its minimum value:
$\varepsilon_{B}^{2}=\left.\left|\boldsymbol{C}_{22}\right|\right|^{2}=\sum_{m=r+1}^{k} \sigma_{m}^{2}=\varepsilon_{r}^{2}$
Therefore, we can draw the conclusion that $\hat{A}$ is the best rank $r$ approximation to $A$ based on minimisation of the error matrix’ Frobenius norm $\|\boldsymbol{A}-\boldsymbol{B}\|$