引理1
设
U
,
V
\mathbf{U},\mathbf{V}
U,V是酉矩阵
∥
U
A
∥
2
=
∥
A
V
∥
2
=
∥
U
A
V
∥
2
=
∥
A
∥
2
\|\mathbf{U}\mathbf{A}\|_2=\|\mathbf{A}\mathbf{V}\|_2=\|\mathbf{U}\mathbf{A}\mathbf{V}\|_2=\|\mathbf{A}\|_2
∥UA∥2=∥AV∥2=∥UAV∥2=∥A∥2
证明:
利用
∥
A
∥
2
=
λ
m
a
x
(
A
H
A
)
\|\mathbf{A}\|_2=\sqrt{\lambda_{max}\left(\mathbf{A}^H\mathbf{A}\right)}
∥A∥2=λmax(AHA)就很显然了
引理2
设
A
,
B
∈
C
m
×
n
\mathbf{A},\mathbf{B}\in\mathbb{C}^{m\times n}
A,B∈Cm×n,
q
=
min
{
m
,
n
}
q=\min\left\{m,n\right\}
q=min{m,n}
σ
1
≥
σ
2
≥
⋯
≥
σ
q
\sigma_1\ge\sigma_2\ge \cdots\ge\sigma_q
σ1≥σ2≥⋯≥σq代表奇异值
则
σ
i
+
j
−
1
(
A
+
B
)
≤
σ
i
(
A
)
+
σ
j
(
B
)
\sigma_{i+j-1}\left(\mathbf{A}+\mathbf{B}\right)\le \sigma_i\left(\mathbf{A}\right)+\sigma_j\left(\mathbf{B}\right)
σi+j−1(A+B)≤σi(A)+σj(B)
其中
1
≤
i
,
j
≤
q
,
i
+
j
≤
q
+
1
1\le i,j\le q,i+j\le q+1
1≤i,j≤q,i+j≤q+1
证明:
对
A
,
B
\mathbf{A},\mathbf{B}
A,B做SVD分解
A
=
V
Σ
A
W
H
\mathbf{A}=\mathbf{V}\mathbf{\Sigma}_{\mathbf{A}}\mathbf{W}^H
A=VΣAWH
B
=
X
Σ
B
Y
H
\mathbf{B}=\mathbf{X}\mathbf{\Sigma}_{\mathbf{B}}\mathbf{Y}^H
B=XΣBYH
设
W
=
(
w
1
,
⋯
,
w
n
)
,
Y
=
(
y
1
,
⋯
,
y
n
)
\mathbf{W}=\left(\mathbf{w}_1,\cdots,\mathbf{w}_n\right),\mathbf{Y}=\left(\mathbf{y}_1,\cdots,\mathbf{y}_n\right)
W=(w1,⋯,wn),Y=(y1,⋯,yn)
V
=
(
w
1
,
⋯
,
w
m
)
,
X
=
(
y
1
,
⋯
,
y
m
)
\mathbf{V}=\left(\mathbf{w}_1,\cdots,\mathbf{w}_m\right),\mathbf{X}=\left(\mathbf{y}_1,\cdots,\mathbf{y}_m\right)
V=(w1,⋯,wm),X=(y1,⋯,ym)
设
S
′
=
span
{
w
i
,
⋯
,
w
n
}
\mathbf{S}'=\operatorname{span}\left\{\mathbf{w}_i,\cdots,\mathbf{w}_n\right\}
S′=span{wi,⋯,wn}
S
′
′
=
span
{
y
i
,
⋯
,
y
n
}
\mathbf{S}''=\operatorname{span}\left\{\mathbf{y}_i,\cdots,\mathbf{y}_n\right\}
S′′=span{yi,⋯,yn}
注意到
v
=
dim
(
S
′
∩
S
′
′
)
=
dim
(
S
′
)
+
dim
(
S
′
′
)
−
dim
(
S
′
∪
S
′
′
)
=
n
−
i
+
1
+
n
−
j
+
1
−
dim
(
S
′
∪
S
′
′
)
≥
n
−
i
+
1
+
n
−
j
+
1
−
n
=
n
−
i
+
1
−
j
+
1
≥
1
\begin{aligned} v&=\operatorname{dim}\left(\mathbf{S}'\cap\mathbf{S}''\right)\\ &=\operatorname{dim}\left(\mathbf{S}'\right)+\operatorname{dim}\left(\mathbf{S}''\right)-\operatorname{dim}\left(\mathbf{S}'\cup\mathbf{S}''\right)\\ &=n-i+1+n-j+1-\operatorname{dim}\left(\mathbf{S}'\cup\mathbf{S}''\right)\\ &\ge n-i+1+n-j+1-n\\ &=n-i+1-j+1\\ &\ge1 \end{aligned}
v=dim(S′∩S′′)=dim(S′)+dim(S′′)−dim(S′∪S′′)=n−i+1+n−j+1−dim(S′∪S′′)≥n−i+1+n−j+1−n=n−i+1−j+1≥1
利用Min-max theorem
n
−
i
+
1
+
n
−
j
+
1
−
v
=
dim
(
S
′
∪
S
′
′
)
<
=
n
⇒
n
−
v
+
1
≤
i
+
j
−
1
n-i+1+n-j+1-v=\operatorname{dim}\left(\mathbf{S}'\cup\mathbf{S}''\right)<=n\Rightarrow n-v+1\le i+j-1
n−i+1+n−j+1−v=dim(S′∪S′′)<=n⇒n−v+1≤i+j−1
所以
σ
i
+
j
−
1
(
A
+
B
)
≤
σ
n
−
v
+
1
(
A
+
B
)
=
min
S
∈
C
dim
(
S
)
=
v
max
x
∈
S
∥
x
∥
=
1
∥
(
A
+
B
)
x
∥
≤
max
x
∈
S
′
∩
S
′
′
∥
x
∥
=
1
∥
(
A
+
B
)
x
∥
≤
max
x
∈
S
′
∩
S
′
′
∥
x
∥
=
1
∥
A
x
∥
+
max
x
∈
S
′
∩
S
′
′
∥
x
∥
=
1
∥
B
x
∥
≤
max
x
∈
S
′
∥
x
∥
=
1
∥
A
x
∥
+
max
x
∈
S
′
′
∥
x
∥
=
1
∥
B
x
∥
=
σ
i
(
A
)
+
σ
j
(
B
)
\begin{aligned} \sigma_{i+j-1}\left(\mathbf{A}+\mathbf{B}\right)&\le\sigma_{n-v+1}\left(\mathbf{A}+\mathbf{B}\right)\\ &=\min_{\mathbf{S}\in\mathbb{C}\atop \operatorname{dim}\left(\mathbf{S}\right)=v}\max_{\mathbf{x}\in\mathbf{S}\atop\|\mathbf{x}\|=1}\|\left(\mathbf{A}+\mathbf{B}\right)\mathbf{x}\|\\ &\le\max_{\mathbf{x}\in\mathbf{S}'\cap \mathbf{S}''\atop\|\mathbf{x}\|=1}\|\left(\mathbf{A}+\mathbf{B}\right)\mathbf{x}\|\\ &\le\max_{\mathbf{x}\in\mathbf{S}'\cap \mathbf{S}''\atop\|\mathbf{x}\|=1}\|\mathbf{A}\mathbf{x}\|+\max_{\mathbf{x}\in\mathbf{S}'\cap \mathbf{S}''\atop\|\mathbf{x}\|=1}\|\mathbf{B}\mathbf{x}\|\\ &\le\max_{\mathbf{x}\in\mathbf{S}'\atop\|\mathbf{x}\|=1}\|\mathbf{A}\mathbf{x}\|+\max_{\mathbf{x}\in\mathbf{S}''\atop\|\mathbf{x}\|=1}\|\mathbf{B}\mathbf{x}\|\\ &=\sigma_i\left(\mathbf{A}\right)+\sigma_j\left(\mathbf{B}\right) \end{aligned}
σi+j−1(A+B)≤σn−v+1(A+B)=dim(S)=vS∈Cmin∥x∥=1x∈Smax∥(A+B)x∥≤∥x∥=1x∈S′∩S′′max∥(A+B)x∥≤∥x∥=1x∈S′∩S′′max∥Ax∥+∥x∥=1x∈S′∩S′′max∥Bx∥≤∥x∥=1x∈S′max∥Ax∥+∥x∥=1x∈S′′max∥Bx∥=σi(A)+σj(B)
Eckart-Young Theorem
设矩阵
A
\mathbf{A}
A有SVD分解
A
=
U
Σ
V
T
\mathbf{A}=\mathbf{U}\mathbf{\Sigma}\mathbf{V}^T
A=UΣVT,其中
U
,
V
\mathbf{U},\mathbf{V}
U,V为正交矩阵
设
k
<
r
=
rank
(
A
)
k<r=\operatorname{rank}\left(\mathbf{A}\right)
k<r=rank(A)
A
k
=
∑
i
=
1
k
σ
i
u
i
v
i
T
\mathbf{A}_k=\sum_{i=1}^{k}\sigma_i\mathbf{u}_i\mathbf{v}_i^T
Ak=∑i=1kσiuiviT
其中
σ
i
\sigma_i
σi为
A
\mathbf{A}
A的奇异值,设
A
\mathbf{A}
A有
p
p
p个奇异值
σ
1
≥
σ
2
≥
⋯
≥
σ
r
>
σ
r
+
1
=
⋯
=
σ
p
=
0
\sigma_1\ge \sigma_2\ge \cdots\ge \sigma_r>\sigma_{r+1}=\cdots=\sigma_p=0
σ1≥σ2≥⋯≥σr>σr+1=⋯=σp=0
则
min
rank
(
B
)
=
k
∥
A
−
B
∥
2
=
∥
A
−
A
k
∥
2
=
σ
k
+
1
\min_{\operatorname{rank}\left(\mathbf{B}\right)=k}\|\mathbf{A}-\mathbf{B}\|_2=\|\mathbf{A}-\mathbf{A}_k\|_2=\sigma_{k+1}
rank(B)=kmin∥A−B∥2=∥A−Ak∥2=σk+1
min
rank
(
B
)
=
k
∥
A
−
B
∥
F
=
∥
A
−
A
k
∥
F
=
∑
i
=
k
+
1
p
σ
i
2
\min_{\operatorname{rank}\left(\mathbf{B}\right)=k}\|\mathbf{A}-\mathbf{B}\|_F=\|\mathbf{A}-\mathbf{A}_k\|_F=\sqrt{\sum_{i=k+1}^{p}\sigma_i^2}
rank(B)=kmin∥A−B∥F=∥A−Ak∥F=i=k+1∑pσi2
证明
二范数形式
A
=
∑
i
=
1
p
σ
i
u
i
v
i
T
\mathbf{A}=\sum_{i=1}^{p}\sigma_i\mathbf{u}_i\mathbf{v}_i^T
A=∑i=1pσiuiviT
A
k
=
∑
i
=
1
k
σ
i
u
i
v
i
T
\mathbf{A}_k=\sum_{i=1}^{k}\sigma_i\mathbf{u}_i\mathbf{v}_i^T
Ak=∑i=1kσiuiviT
A
−
A
k
\mathbf{A}-\mathbf{A}_k
A−Ak的前
k
k
k个奇异值为0,剩下的
p
−
k
p-k
p−k个奇异值为
σ
k
+
1
,
⋯
,
σ
p
\sigma_{k+1},\cdots,\sigma_{p}
σk+1,⋯,σp
因为
∥
A
∥
2
=
σ
1
=
σ
m
a
x
\|\mathbf{A}\|_2=\sigma_1=\sigma_{max}
∥A∥2=σ1=σmax
于是
∥
A
−
A
k
+
1
∥
2
=
σ
k
+
1
\|\mathbf{A}-\mathbf{A}_{k+1}\|_2=\sigma_{k+1}
∥A−Ak+1∥2=σk+1
接着证明其他的解>=最优解
因为
rank
(
B
)
=
k
\operatorname{rank}\left(\mathbf{B}\right)=k
rank(B)=k
dim
N
(
B
)
=
n
−
k
\operatorname{dim}N\left(\mathbf{B}\right)=n-k
dimN(B)=n−k
于是存在
n
−
k
n-k
n−k个标准正交向量
x
1
,
⋯
,
x
n
−
k
\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}
x1,⋯,xn−k,使得
N
(
B
)
=
s
p
a
n
{
x
1
,
⋯
,
x
n
−
k
}
N\left(\mathbf{B}\right)=span\left\{\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}\right\}
N(B)=span{x1,⋯,xn−k}
又因为
dim
s
p
a
n
{
x
1
,
⋯
,
x
n
−
k
}
+
dim
s
p
a
n
{
v
1
,
⋯
,
v
k
+
1
}
=
n
+
1
>
n
\operatorname{dim}span\left\{\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}\right\}+\operatorname{dim}span\left\{\mathbf{v}_1,\cdots,\mathbf{v}_{k+1}\right\}=n+1>n
dimspan{x1,⋯,xn−k}+dimspan{v1,⋯,vk+1}=n+1>n
所以
s
p
a
n
{
x
1
,
⋯
,
x
n
−
k
}
∩
s
p
a
n
{
v
1
,
⋯
,
v
k
+
1
}
≠
{
0
}
span\left\{\mathbf{x}_1,\cdots,\mathbf{x}_{n-k}\right\}\cap span\left\{\mathbf{v}_1,\cdots,\mathbf{v}_{k+1}\right\}\neq\left\{0\right\}
span{x1,⋯,xn−k}∩span{v1,⋯,vk+1}={0}
存在
z
(
∥
z
∥
=
1
)
\mathbf{z}\left(\|\mathbf{z}\|=1\right)
z(∥z∥=1),使得
B
z
=
0
\mathbf{Bz}=0
Bz=0且
z
∈
s
p
a
n
{
v
1
,
⋯
,
v
k
+
1
}
\mathbf{z}\in span\left\{\mathbf{v}_1,\cdots,\mathbf{v}_{k+1}\right\}
z∈span{v1,⋯,vk+1}
z
=
∑
i
=
1
k
+
1
k
i
v
i
\mathbf{z}=\sum_{i=1}^{k+1}k_i\mathbf{v}_i
z=∑i=1k+1kivi并且
∑
i
=
1
k
+
1
k
i
2
=
1
\sum_{i=1}^{k+1}k_i^2=1
∑i=1k+1ki2=1
∥ A − B ∥ 2 = ∥ A − B ∥ 2 ∥ z ∥ ≥ ∥ ( A − B ) z ∥ = ∥ A z ∥ = ∥ U Σ V T z ∥ = ∥ Σ V T z ∥ = ∑ i = 1 p ( σ i v i T z ) 2 = ∑ i = 1 k + 1 ( σ i v i T z ) 2 ≥ σ k + 1 ∑ i = 1 k + 1 ( v i T z ) 2 = σ k + 1 ∑ i = 1 k + 1 k i 2 = σ k + 1 \begin{aligned} &\|\mathbf{A}-\mathbf{B}\|_2\\ =&\|\mathbf{A}-\mathbf{B}\|_2\|\mathbf{z}\|\\ \ge&\|\left(\mathbf{A}-\mathbf{B}\right)\mathbf{z}\|\\ =&\|\mathbf{Az}\|\\ =&\|\mathbf{U}\mathbf{\Sigma}\mathbf{V}^T\mathbf{z}\|\\ =&\|\mathbf{\Sigma}\mathbf{V}^T\mathbf{z}\|\\ =&\sum_{i=1}^{p}\left(\sigma_i\mathbf{v}_i^T\mathbf{z}\right)^2\\ =&\sum_{i=1}^{k+1}\left(\sigma_i\mathbf{v}_i^T\mathbf{z}\right)^2\\ \ge&\sigma_{k+1}\sum_{i=1}^{k+1}\left(\mathbf{v}_i^T\mathbf{z}\right)^2\\ =&\sigma_{k+1}\sum_{i=1}^{k+1}k_i^2\\ =&\sigma_{k+1} \end{aligned} =≥=====≥==∥A−B∥2∥A−B∥2∥z∥∥(A−B)z∥∥Az∥∥UΣVTz∥∥ΣVTz∥i=1∑p(σiviTz)2i=1∑k+1(σiviTz)2σk+1i=1∑k+1(viTz)2σk+1i=1∑k+1ki2σk+1
F范数形式
利用引理2
∥
A
−
A
k
∥
F
2
=
∑
i
=
k
+
1
r
σ
i
(
A
)
=
∑
i
=
k
+
1
r
σ
i
(
A
−
B
+
B
)
≤
∑
i
=
k
+
1
r
(
σ
i
−
k
(
A
−
B
)
+
σ
k
+
1
(
B
)
)
=
∑
i
=
k
+
1
r
σ
i
−
k
(
A
−
B
)
≤
∑
i
=
1
r
−
k
σ
i
(
A
−
B
)
≤
∥
A
−
B
∥
F
2
\begin{aligned} &\|\mathbf{A}-\mathbf{A}_k\|_F^2\\ =&\sum_{i=k+1}^r\sigma_i\left(\mathbf{A}\right)\\ =&\sum_{i=k+1}^r\sigma_i\left(\mathbf{A}-\mathbf{B}+\mathbf{B}\right)\\ \le&\sum_{i=k+1}^r\left(\sigma_{i-k}\left(\mathbf{A}-\mathbf{B}\right)+\sigma_{k+1}\left(\mathbf{B}\right)\right)\\ =&\sum_{i=k+1}^r\sigma_{i-k}\left(\mathbf{A}-\mathbf{B}\right)\\ \le&\sum_{i=1}^{r-k}\sigma_{i}\left(\mathbf{A}-\mathbf{B}\right)\\ \le&\|\mathbf{A}-\mathbf{B}\|_F^2 \end{aligned}
==≤=≤≤∥A−Ak∥F2i=k+1∑rσi(A)i=k+1∑rσi(A−B+B)i=k+1∑r(σi−k(A−B)+σk+1(B))i=k+1∑rσi−k(A−B)i=1∑r−kσi(A−B)∥A−B∥F2
所以成立