Block Coordinate Descent算法的部分构造技巧

君當如蘭

已于 2024-02-23 11:13:34 修改

阅读量756

点赞数 7

文章标签：算法

于 2024-02-22 15:59:53 首次发布

本文链接：https://blog.csdn.net/qq_45542321/article/details/136234652

版权

文章目录

构造的目的
定理
另一篇中对于该定理的表述
该构造与WMMSE算法的关系
- WMMSE与BCD构造的对比辨析
出处

构造的目的

通过增加辅助变量，使原来的非凸问题变为关于各个变量的凸子问题，交替优化各个辅助变量。

定理

Define an $m$ by $m$ matrix function
$\mathbf{E}(\mathbf{U}, \mathbf{V}) \triangleq\left(\mathbf{I}-\mathbf{U}^{H} \mathbf{H} \mathbf{V}\right)\left(\mathbf{I}-\mathbf{U}^{H} \mathbf{H} \mathbf{V}\right)^{H}+\mathbf{U}^{H} \mathbf{N} \mathbf{U}$
where $\mathbf{N}$ is any positive definite matrix. The following three facts hold true.

$\text { For any positive definite matrix } \mathbf{E} \in \mathbb{C}^{m \times m} \text {, we have }$
$\mathbf{E}^{-1}=\arg \max _{\mathbf{W} \succ \mathbf{0}} \log \operatorname{det}(\mathbf{W})-\operatorname{Tr}(\mathbf{W E})$
（注：argmax表示找到使某个函数取得最大值的参数值）
and
$-\log \operatorname{det}(\mathbf{E})=\max _{\mathbf{W} \succ \mathbf{0}} \log \operatorname{det}(\mathbf{W})-\operatorname{Tr}(\mathbf{W E})+m$
$\text { For any positive definite matrix } \mathbf{W} \text {, we have }$
$\begin{aligned} \tilde{\mathbf{U}} & \triangleq \arg \min _{\mathbf{U}} \operatorname{Tr}(\mathbf{W E}(\mathbf{U}, \mathbf{V})) \\ & =\left(\mathbf{N}+\mathbf{H V V} \mathbf{H}^{H} \mathbf{H}^{H}\right)^{-1} \mathbf{H V} \end{aligned}$
and
$\begin{aligned} \mathbf{E}(\tilde{\mathbf{U}}, \mathbf{V}) & =\mathbf{I}-\tilde{\mathbf{U}}^{H} \mathbf{H} \mathbf{V} \\ & =\left(\mathbf{I}+\mathbf{V}^{H} \mathbf{H}^{H} \mathbf{N}^{-1} \mathbf{H V}\right)^{-1} . \end{aligned}$

3） We have
$\begin{aligned} \log \operatorname{det}(\mathbf{I}+ & \left.\mathbf{H V} \mathbf{V}^{H} \mathbf{H}^{H} \mathbf{N}^{-1}\right) \\ & =\max _{\mathbf{W} \succ \mathbf{0}, \mathbf{U}} \log \operatorname{det}(\mathbf{W})-\operatorname{Tr}(\mathbf{W E}(\mathbf{U}, \mathbf{V}))+m \end{aligned}$

Facts 1) and 2) can be proven by simply using the first-order optimality condition, while Fact 3) directly follows from Facts 1) and 2) and the identity $\log \operatorname{det}(\mathbf{I}+\mathbf{A B})=\log \operatorname{det}(\mathbf{I}+\mathbf{B A})$ . We refer readers to [32], [33] for more detailed proof.

Next, using Lemma 4.1, we derive an equivalent problem of problem (5) by introducing some auxiliary variables. Define
$\mathbb{E}(\mathbf{U}, \mathbf{V}) \triangleq\left(\mathbf{I}-\mathbf{U}^{H} \mathbf{H}_{I} \mathbf{V}\right)\left(\mathbf{I}-\mathbf{U}^{H} \mathbf{H}_{I} \mathbf{V}\right)^{H}+\mathbf{U}^{H} \mathbf{U} .$

Then we have from Fact 3) that

$\begin{aligned} \log \operatorname{det}(\mathbf{I} & \left.+\mathbf{H}_{I} \mathbf{V} \mathbf{V}^{H} \mathbf{H}_{I}^{H}\right) \\ & =\max _{\mathbf{W}_{I} \succ 0, \mathbf{U}} \log \operatorname{det}\left(\mathbf{W}_{I}\right)-\operatorname{Tr}\left(\mathbf{W}_{I} \mathbb{E}(\mathbf{U}, \mathbf{V})\right)+d \end{aligned}$

Furthermore, from Fact 1), we have

$\begin{array}{l} -\log \operatorname{det}\left(\mathbf{I}+\mathbf{H}_{E} \mathbf{V} \mathbf{V}^{H} \mathbf{H}_{E}^{H}\right) \\ =\max _{\mathbf{W}_{E} \succ 0} \log \operatorname{det}\left(\mathbf{W}_{E}\right)-\operatorname{Tr}\left(\mathbf{W}_{E}\left(\mathbf{I}+\mathbf{H}_{E} \mathbf{V} \mathbf{V}^{H} \mathbf{H}_{E}^{H}\right)\right)+N_{E} . \end{array}$

在这里插入图片描述

另一篇中对于该定理的表述

Physical Layer Security in Near-Field Communications

在这里插入图片描述

该构造与WMMSE算法的关系

考虑一个全数字波束成形优化问题：
$\begin{array}{ll} \max _{\mathbf{W}_{\mathrm{FD}}} & \log _{2}\left|\mathbf{I}_{M_{\mathrm{B}}}+\sigma_{\mathrm{B}}^{-2} \mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}} \mathbf{W}_{\mathrm{FD}}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}}^{\mathrm{H}}\right| \\ \text { s.t. } & \left\|\mathbf{W}_{\mathrm{FD}}\right\|_{\mathrm{F}}^{2} \leqslant P_{\max }, \\ & \left\|\mathbf{H}_{\mathrm{W}} \mathbf{W}_{\mathrm{FD}}\right\|_{\mathrm{F}}^{2} \leqslant p_{\text {leak. }} . \tag{26} \end{array}$
基于 WMMSE 的算法被设计来解决这个子问题。其主要思想是通过利用速率最大化问题和均方误差（MSE）最小化问题之间的等价性，将原始问题转化为更容易处理的形式[38]。
Specifically, the signal vector $\widetilde{\mathbf{s}}$ at Bob is estimated by an introduced linear receive beamforming matrix $\mathbf{U} \in \mathbb{C}^{M_{\mathrm{B}} \times L}$ as $\widetilde{\mathbf{s}}=\mathbf{U}^{\mathrm{H}} \mathbf{y}_{\mathrm{B}}$ . Then, the MSE matrix at Bob can be written as

$\begin{aligned} \mathbf{E} & =\mathbb{E}_{\mathbf{s}, \mathbf{n}_{\mathrm{B}}}\left[(\widetilde{\mathbf{s}}-\mathbf{s})(\widetilde{\mathbf{s}}-\mathbf{s})^{\mathrm{H}}\right] \\ = & \left(\mathbf{I}_{M_{\mathrm{R}}}-\mathrm{U}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}}\right)\left(\mathbf{I}_{M_{\mathrm{R}}}-\mathrm{U}^{\mathrm{H}} \mathrm{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}}\right)^{\mathrm{H}}+\sigma_{\mathrm{R}}^{2} \mathbf{U}^{\mathrm{H}} \mathbf{U} . \end{aligned} \tag{27}$

By introducing a weight matrix $\Psi \succcurlyeq 0$ for Bob, the subproblem (26) can be equivalently reformulated as [38 , Thm. 1]

$\begin{array}{ll} \min _{\boldsymbol{\Psi}, \mathbf{U}, \mathbf{W}_{\mathrm{FD}}} & \operatorname{Tr}(\boldsymbol{\Psi})-\log _{2}|\boldsymbol{\Psi}| \\ \text { s.t. } & (26 b),(26 \mathrm{c}) . \tag{28} \end{array}$

Although the transformed problem has more optimization variables than (26), the objective function in (28) is more tractable. The receive beamforming matrix $\mathbf{U}$ and the weight matrix $\Psi$ only appear in the objective function (28a). By setting the derivatives of (28a) with respective to $\mathbf{U}$ and $\Psi$ to zero, respectively, the optimal solutions can be obtained as

$\begin{array}{l} \mathbf{U}^{\star}=\left(\mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}} \mathbf{W}_{\mathrm{FD}}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}}^{\mathrm{H}}+\sigma_{\mathrm{B}}^{2} \mathbf{I}_{M_{\mathrm{B}}}\right)^{-1} \mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}}, \\ \mathbf{\Psi}^{\star}=\mathbf{E}^{-1} \tag{29} \end{array}$

Substituting the optimal \mathbf{U}^{\star} in (29) into (27) yields the optimal MSE matrix as follows

$\begin{array}{l} \mathbf{E}^{\star}= \\ \mathbf{I}_{M_{\mathrm{B}}}-\mathbf{W}_{\mathrm{FD}}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}}^{\mathrm{H}}\left(\mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}} \mathbf{W}_{\mathrm{FD}}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}}^{\mathrm{H}}+\sigma_{\mathrm{B}}^{2} \mathbf{I}_{M_{\mathrm{B}}}\right)^{-1} \mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}} . \tag{30} \end{array}$

Substituting (27) into the objective function of (28) and discarding the constant terms, the problem that updates the full-digital beamforming matrix \mathbf{W}_{\mathrm{FD}} is transformed as

$\begin{array}{ll} \min _{\mathbf{W}_{\mathrm{FD}}} & \operatorname{Tr}\left(\mathbf{W}_{\mathrm{FD}}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}}^{\mathrm{H}} \mathbf{U} \boldsymbol{\Psi} \mathbf{U}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}}\right)-\operatorname{Tr}\left(\boldsymbol{\Psi} \mathbf{W}_{\mathrm{FD}}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}}^{\mathrm{H}} \mathbf{U}\right) \\ & -\operatorname{Tr}\left(\boldsymbol{\Psi} \mathbf{U}^{\mathrm{H}} \mathbf{H}_{\mathrm{B}} \mathbf{W}_{\mathrm{FD}}\right) \\ \text { s.t. } & (26 \mathrm{~b}),(26 \mathrm{c}) . \end{array}$