Chapter 6 (Orthogonality and Least Squares): Orthogonal projections (正交投影)

最新推荐文章于 2024-12-16 07:30:00 发布

连理o

最新推荐文章于 2024-12-16 07:30:00 发布

阅读量2.6k

点赞数 2

分类专栏：线性代数文章标签：线性代数

本文链接：https://blog.csdn.net/weixin_42437114/article/details/108883427

版权

线性代数专栏收录该内容

50 篇文章

订阅专栏

本文为《Linear algebra and its applications》的读书笔记

Orthogonal projections

Given a vector $\boldsymbol y$ and a subspace $W$ in $\mathbb R^n$ , there is a vector $\hat \boldsymbol y$ in $W$ such that
- (1) $\hat \boldsymbol y$ is the unique vector in $W$ for which $\boldsymbol y -\hat\boldsymbol y$ is orthogonal to $W$
- (2) $\hat\boldsymbol y$ is the unique vector in $W$ closest to $\boldsymbol y$

EXAMPLE 1

Let $\{\boldsymbol u_1,...,\boldsymbol u_5\}$ be an orthogonal basis for $\mathbb R^5$ and let
在这里插入图片描述 Consider the subspace $\{\boldsymbol u_1,\boldsymbol u_2\}$ , and write $\boldsymbol y$ as the sum of a vector $\boldsymbol z_1$ in $W$ and a vector $\boldsymbol z_2$ in $W^\perp$ .

SOLUTION

在这里插入图片描述

The next theorem shows that the decomposition $\boldsymbol y =\boldsymbol z_1 +\boldsymbol z_2$ in Example 1 can be computed without having an orthogonal basis for $\mathbb R^n$ . It is enough to have an orthogonal basis only for $W$ .

在这里插入图片描述

The vector $\hat \boldsymbol y$ in (1) is called the orthogonal projection of $\boldsymbol y$ onto $W$ and often is written as $proj_W\boldsymbol y$ . When $W$ is a one-dimensional subspace, the formula for $\hat\boldsymbol y$ matches the formula given in Section 6.2.

PROOF

We may assume that $W$ is not the zero subspace, for otherwise $W^\perp=\mathbb R^n$ and (1) is simply $\boldsymbol y =\boldsymbol 0+\boldsymbol y$ . The next section will show that any nonzero subspace of $\mathbb R^n$ has an orthogonal basis.

Let $\{\boldsymbol u_1,...,\boldsymbol u_p\}$ be any orthogonal basis for $W$ , and define $\hat\boldsymbol y$ by (2). Let $\boldsymbol z =\boldsymbol y -\hat\boldsymbol y$ , then
Thus $\boldsymbol z$ is orthogonal to $\boldsymbol u_1$ . Similarly, $\boldsymbol z$ is orthogonal to each $\boldsymbol u_j$ in the basis for $W$ . Hence $\boldsymbol z$ is orthogonal to every vector in $W$ . That is, $\boldsymbol z$ is in $W^\perp$ .
To show that the decomposition in (1) is unique, suppose $\boldsymbol y$ can also be written as $\boldsymbol y =\hat\boldsymbol y_1 +\boldsymbol z_1$ , with $\hat\boldsymbol y_1$ in $W$ and $\boldsymbol z_1$ in $W^\perp$ . Then $\hat\boldsymbol y+\boldsymbol z =\hat\boldsymbol y_1+\boldsymbol z_1$ , and so
$\hat\boldsymbol y-\hat\boldsymbol y _1=\boldsymbol z_1-\boldsymbol z$ This equality shows that the vector $\boldsymbol v =\hat\boldsymbol y-\hat\boldsymbol y_1$ is in $W$ and in $W^\perp$ . Hence $\boldsymbol v\cdot \boldsymbol v = 0$ , which shows that $\boldsymbol v =\boldsymbol 0$ . This proves that $\hat\boldsymbol y=\hat\boldsymbol y_1$ and also $\boldsymbol z_1 =\boldsymbol z$ .

EXERCISE

Suppose that $\{\boldsymbol u_1, \boldsymbol u_2\}$ is an orthogonal set of nonzero vectors in $\mathbb R^3$ . How would you find an orthogonal basis of $\mathbb R^3$ that contains $\boldsymbol u_1$ and $\boldsymbol u_2$ ?

SOLUTION

First, find a vector $\boldsymbol v$ in $\mathbb R^3$ that is not in the subspace $W$ spanned by $\boldsymbol u_1$ and $\boldsymbol u_2$ . Let $\boldsymbol u_3=\boldsymbol v-proj_W\boldsymbol v$ , then $\{\boldsymbol u_1, \boldsymbol u_2, \boldsymbol u_3\}$ is an orthogonal basis.

EXERCISE 23

Let $A$ be an $\times n$ matrix. Prove that every vector $\boldsymbol x$ in $\mathbb R^n$ can be written in the form $\boldsymbol x=\boldsymbol p +\boldsymbol u$ , where $\boldsymbol p$ is in $R o w A$ and $\boldsymbol u$ is in $N u l A$ . Also, show that if the equation $A\boldsymbol x =\boldsymbol b$ is consistent, then there is a unique $\boldsymbol p$ in $R o w A$ such that $A\boldsymbol p=\boldsymbol b$ .

SOLUTION

By the Orthogonal Decomposition Theorem, each $\boldsymbol x$ in $\mathbb R^n$ can be written uniquely as $\boldsymbol x = \boldsymbol p + \boldsymbol u$ , with $\boldsymbol p$ in $R o w A$ and $\boldsymbol u$ in $A)^\perp=Nul\ A$ .
Next, suppose that $A\boldsymbol x = \boldsymbol b$ is consistent. Let $\boldsymbol x$ be a solution, and write $\boldsymbol x = \boldsymbol p +\boldsymbol u$ , as above. Then $A\boldsymbol p = A(\boldsymbol x – \boldsymbol u) = A\boldsymbol x – A\boldsymbol u = \boldsymbol b – \boldsymbol 0 = \boldsymbol b$ . So the equation $A\boldsymbol x = \boldsymbol b$ has at least one solution $\boldsymbol p$ in $R o w A$ .
Finally, suppose that $\boldsymbol p$ and $\boldsymbol p_1$ are both in $R o w A$ and satisfy $A\boldsymbol x = \boldsymbol b$ . Then $\boldsymbol p – \boldsymbol p_1$ is in $N u l A$ because $(\boldsymbol p – \boldsymbol p_1) = A\boldsymbol p – A\boldsymbol p_1 = \boldsymbol b – \boldsymbol b = \boldsymbol 0$ The equations $\boldsymbol p = \boldsymbol p_1 + (\boldsymbol p – \boldsymbol p_1)$ and $\boldsymbol p = \boldsymbol p + \boldsymbol 0$ both decompose $\boldsymbol p$ as the sum of a vector in $R o w A$ and a vector in $Row A)^T$ . By the uniqueness of the orthogonal decomposition (Theorem 8), $\boldsymbol p_1 = \boldsymbol p$ , so $\boldsymbol p$ is unique.

A Geometric Interpretation of the Orthogonal Projection

When $W$ is a one-dimensional subspace, the formula (2) for $proj_W \boldsymbol y$ contains just one term. Thus, when $d i m W > 1$ , each term in (2) is itself an orthogonal projection of $\boldsymbol y$ onto a one-dimensional subspace spanned by one of the $\boldsymbol u$ ’s in the basis for $W$ . Figure 3 illustrates this when $W$ is a subspace of $\mathbb R^3$ spanned by $\boldsymbol u_1$ and $\boldsymbol u_2$ .

Properties of Orthogonal Projections

在这里插入图片描述

This fact also follows from the next theorem.

在这里插入图片描述

最佳逼近定理

The vector $\boldsymbol y$ in Theorem 9 is called the best approximation to $\boldsymbol y$ by elements of $W$ ( $W$ 中元素对 $\boldsymbol y$ 的最佳逼近).
- Later sections in the text will examine problems where a given $\boldsymbol y$ must be replaced, or approximated, by a vector $\boldsymbol v$ in some fixed subspace $W$ . The distance $\left\|\boldsymbol y-\boldsymbol v\right\|$ , can be regarded as the “error” of using $\boldsymbol v$ in place of $\boldsymbol y$ . Theorem 9 says that this error is minimized when $\boldsymbol v =\hat\boldsymbol y$ .
Inequality (3) leads to a new proof that $\hat\boldsymbol y$ does not depend on the particular orthogonal basis used to compute it.

PROOF

Take $\boldsymbol v$ in $W$ distinct from $\hat\boldsymbol y$ . See Figure 4. Then $\boldsymbol y -\hat \boldsymbol y$ is orthogonal to $\hat\boldsymbol y-\boldsymbol v$ (which is in $W$ ). Since
the Pythagorean Theorem(勾股定理) gives
Now $\left\|\hat\boldsymbol y -\boldsymbol v\right\| > 0$ , and so inequality (3) follows immediately.

The final theorem in this section shows how formula (2) for $proj_W \boldsymbol y$ is simplified when the basis for $W$ is an orthonormal set.

在这里插入图片描述

Suppose $U$ is an $\times p$ matrix with orthonormal columns, and let $W$ be the column space of $U$ . Then

EXAMPLE

Let $W$ be a subspace of $\mathbb R^n$ . Let $\boldsymbol x$ and $\boldsymbol y$ be vectors in $\mathbb R^n$ and let $\boldsymbol z =\boldsymbol x + \boldsymbol y$ . If $\boldsymbol u$ is the projection of $\boldsymbol x$ onto $W$ and $\boldsymbol v$ is the projection of $\boldsymbol y$ onto $W$ , show that $\boldsymbol u + \boldsymbol v$ is the projection of $\boldsymbol z$ onto $W$ .

SOLUTION

Let $U$ be a matrix whose columns consist of an orthonormal basis for $W$ . Then
$\begin{aligned}proj_W\boldsymbol z &= UU^T\boldsymbol z \\&= UU^T (\boldsymbol x + \boldsymbol y)\\&= UU^T \boldsymbol x + UU^T \boldsymbol y \\&= proj_W \boldsymbol x + proj_W \boldsymbol y \\&=\boldsymbol u +\boldsymbol v\end{aligned}$