Chapter 2 (Matrix Algebra): Matrix operations

最新推荐文章于 2022-06-06 14:12:35 发布

连理o

最新推荐文章于 2022-06-06 14:12:35 发布

阅读量258

点赞数

分类专栏：线性代数

本文链接：https://blog.csdn.net/weixin_42437114/article/details/117262323

版权

线性代数专栏收录该内容

50 篇文章 28 订阅

订阅专栏

本文为《Linear algebra and its applications》的读书笔记

If $A$ is an $\times n$ matrix, each column of $A$ identifies a vector in $\mathbb R^m$ .
$A=[\ \ \boldsymbol a_1\ \ \ \boldsymbol a_2\ \ \ ...\ \ \ \boldsymbol a_n\ \ ]$
The diagonal entries in an $\times n$ matrix $A = [a_{ij}]$ are $a_{11}$ , $a_{22}$ , $a_{33}$ … and they form the main diagonal (主对角线) of $A$ . A diagonal matrix (对角矩阵) is a square $\times n$ matrix whose nondiagonal entries are zero. An $\times n$ matrix whose entries are all zero is a zero matrix (零矩阵) and is written as $0$ . The size of a zero matrix is usually clear from the context.

The arithmetic for vectors described earlier has a natural extension to matrices.

Sums and Scalar Multiples

Sum

The sum $A + B$ is the $\times n$ matrix whose columns are the sums of the corresponding columns in $A$ and $B$ .
The sum $A + B$ is defined only when $A$ and $B$ are the same size.

Scalar Multiple

If $r$ is a scalar and $A$ is a matrix, then the scalar multiple $r A$ is the matrix whose columns are $r$ times the corresponding columns in $A$ .

Matrix Multiplication

Multiplication of matrices corresponds to composition of linear transformations

When a matrix $B$ multiplies a vector $\boldsymbol x$ , it transforms $\boldsymbol x$ into the vector $B\boldsymbol x$ . If this vector is then multiplied in turn by a matrix $A$ , the resulting vector is $A(B\boldsymbol x)$ . See Figure 2.
Thus $A(B\boldsymbol x)$ is produced from $\boldsymbol x$ by a $c o m p o s i t i o n$ of mappings. Our goal is to represent this composite mapping as multiplication by a single matrix, denoted by $A B$ , so that
$A(B\boldsymbol x)=(AB)\boldsymbol x$

If $A$ is $\times n$ , $B$ is $\times p$ , and $\boldsymbol x$ is in $\mathbb R^p$ . Then
$B\boldsymbol x=x_1\boldsymbol b_1+...+x_p\boldsymbol b_p$ By the linearity of multiplication by $A$ ,
$A(B\boldsymbol x)=A(x_1\boldsymbol b_1)+...+A(x_p\boldsymbol b_p)=x_1A\boldsymbol b_1+...+x_pA\boldsymbol b_p$
The vector $A(B\boldsymbol x)$ is a linear combination of the vectors $A\boldsymbol b_1$ ,…, $A\boldsymbol b_p$ , using the entries in $\boldsymbol x$ as weights. In matrix notation, this linear combination is written as
$A(B\boldsymbol x)=[\ \ A\boldsymbol b_1\ \ \ A\boldsymbol b_2\ \ \ ...\ \ \ A\boldsymbol b_p\ \ ]\boldsymbol x$ Thus multiplication by $[\ \ A\boldsymbol b_1\ \ \ A\boldsymbol b_2\ \ \ ...\ \ \ A\boldsymbol b_p\ \ ]$ transforms $\boldsymbol x$ into $A(B\boldsymbol x)$ . We have found the matrix we sought!

在这里插入图片描述

Each column of $A B$ is a linear combination of the columns of $A$ using weights from the corresponding column of $B$ .
Obviously, the number of columns of $A$ must match the number of rows in $B$ in order for a linear combination such as $A\boldsymbol b_1$ to be defined. Also, the definition of $A B$ shows that $A B$ has the same number of rows as $A$ and the same number of columns as $B$ .

The definition of $A B$ lends itself well to parallel processing on a computer. The columns of $B$ are assigned individually or in groups to different processors, which independently and hence simultaneously compute the corresponding columns of $A B$ .

EXAMPLE 3

Compute $A B$ , where $=\begin{bmatrix}2&3\\1&-5\end{bmatrix}$ and $=\begin{bmatrix}4&3&6\\1&-2&3\end{bmatrix}$ .

SOLUTION
在这里插入图片描述

The definition of $A B$ is important for theoretical work and applications, but the following rule provides a more efficient method for calculating the individual entries in $A B$ when working small problems by hand.

在这里插入图片描述

Let $row_i(A)$ denote the $i$ th row of a matrix $A$ . Then
$row_i(AB)=row_i(A)\cdot B$

Inner product

View vectors in $\mathbb R^n$ as $\times 1$ matrices. For $\boldsymbol u$ and $\boldsymbol v$ in $\mathbb R^n$ , the matrix product $\boldsymbol u^T \boldsymbol v$ is a $\times 1$ matrix, called the scalar product (数量积), or inner product (内积), of $\boldsymbol u$ and $\boldsymbol v$ . It is usually written as a single real number without brackets.
Inner products ( $\boldsymbol u^T \boldsymbol v$ and $\boldsymbol v^T \boldsymbol u$ ) have the transpose symbol in the middle.
$\boldsymbol u^T \boldsymbol v=\boldsymbol v^T \boldsymbol u$

Outer product

The matrix product $\boldsymbol u\boldsymbol v^T$ is an $\times n$ matrix, called the outer product (外积) of $\boldsymbol u$ and $\boldsymbol v$ .
Outer products ( $\boldsymbol u\boldsymbol v^T$ and $\boldsymbol v\boldsymbol u^T$ ) have the transpose symbol on the outside.
外积 $\boldsymbol u\boldsymbol v^T$ 和 $\boldsymbol v\boldsymbol u^T$ 互为对称矩阵

Properties of Matrix Multiplication

Recall that $I_m$ represents the $\times m$ identity matrix and $I_m\boldsymbol x=\boldsymbol x$ for all $\boldsymbol x$ in $\mathbb R^m$ .

在这里插入图片描述

补充性质：
- (1) 两个同阶上三角矩阵的乘积仍为上三角矩阵
  - [Hint: 可以使用分块矩阵的思想来证明；答案可参考 “Partitioned matrices” 的 EXAMPLE 5]
    - (2) 两个同阶下三角矩阵的乘积仍为下三角矩阵
- (3) 两个同阶对角矩阵的乘积仍为对角矩阵，且新的对角元为两个原对角元的乘积
- (4) $A\boldsymbol e_i=\boldsymbol a_i$

PROOF

Property (a)

Property (a) follows from the fact that matrix multiplication corresponds to composition of linear transformations (which are functions), and it is known that the composition of functions is associative.

在离散数学中，函数的复合运算 $f\circ g$ 被定义为关系乘积 $g * f$ ，可以证明关系乘积是满足结合律的，因此函数的复合运算也满足结合律

Here is another proof of (a) that rests on the “column definition” of the product of two matrices. Let
$C=[\ \ \boldsymbol c_1\ \ \ \boldsymbol c_2\ \ \ ...\ \ \ \boldsymbol c_p\ \ ]\\BC=[\ \ B\boldsymbol c_1\ \ \ B\boldsymbol c_2\ \ \ ...\ \ \ B\boldsymbol c_p\ \ ]\\A(BC)=[\ \ A(B\boldsymbol c_1)\ \ \ A(B\boldsymbol c_2)\ \ \ ...\ \ \ A(B\boldsymbol c_p)\ \ ]$ Recall that the definition of $A B$ makes $A(B\boldsymbol x)= (AB)\boldsymbol x$ for all $\boldsymbol x$ , so
$[\ \ (AB)\boldsymbol c_1\ \ \ ...\ \ \ (AB)\boldsymbol c_p\ \ ]=(AB)C$

WARNINGS:

In general, $\neq BA$ . If $A B = B A$ , we say that $A$ and $B$ commute with one another.(可交换的)
The cancellation laws (消去律) do not hold for matrix multiplication. That is, if $A B = A C$ , then it is not true in general that $B = C$ .
If a product $A B$ is the zero matrix, you cannot conclude in general that either $A = 0$ or $B = 0$ .

Tip: When $B$ is square and $C$ has fewer columns than $A$ has rows, it is more efficient to compute $A (B C)$ than $(A B) C$ .

$C h e c k p o i n t$ :

Show that if $\boldsymbol y$ is a linear combination of the columns of $A B$ , then $\boldsymbol y$ is a linear combination of the columns of $A$ .

$Answer\ to\ Checkpoint$ :

If $\boldsymbol y$ is a linear combination of the columns of $A B$ , then there is a vector $\boldsymbol x$ such that $\boldsymbol y$ = $(AB)\boldsymbol x$ . By definition of matrix multiplication, $\boldsymbol y = A(B\boldsymbol x)$ . This expresses $\boldsymbol y$ as a linear combination of the columns of $A$ using the entries in the vector $B\boldsymbol x$ as weights.

EXAMPLE 4

Let $x_1,..., x_n$ be fixed numbers. The matrix below, called a Vandermonde matrix (范德蒙德矩阵), occurs in applications such as signal processing, error-correcting codes (纠错码), and polynomial interpolation.
在这里插入图片描述 Given $\boldsymbol y =(y_1,..., y_n)$ in $\mathbb R^n$ , suppose $\boldsymbol c = (c_0,..., c_{n-1})$ in $\mathbb R^n$ satisfies $\boldsymbol c = \boldsymbol y$ , and define the polynomial
$p(t) = c_0 + c_1t + c_2t^2 + ...+ c_{n-1}t^{ n-1}$

a. Show that $p(x_1)= y_1,..., p(x_n)= y_n$ . We call $p (t)$ an $i n t e r p o l a t i n g$ $p o l y n o m i a l$ $f o r$ $t h e$ $p o i n t s$ $x_1, y_1) ,...,(x_n,y_n)$ because the graph of $p (t)$ passes through the points.
b. Suppose $x_1,...,x_n$ are distinct numbers (相异的数). Show that the columns of $V$ are linearly independent.
c. Prove: “If $x_1,..., x_n$ are distinct numbers, and $y_1,..., y_n$ are arbitrary numbers, then there is an interpolating polynomial of degree $\leq n - 1$ for $x_1, y_1),...,(x_n, y_n)$ .”

SOLUTION

(b) [Hint: How many zeros can a polynomial of degree $n - 1$ have?] (一个 $n - 1$ 次多项式有多少个零点？)

Powers of a Matrix

If $A$ is an $\times n$ matrix and if $k$ is a positive integer, then
If $k = 0$ ; then $A^0\boldsymbol x$ should be $\boldsymbol x$ itself. Thus $A^0$ is interpreted as the identity matrix.