Chapter 2 Solving Linear Equations (Introduction to Linear Algebar written by Dr. Gilber Strang)

最新推荐文章于 2023-06-15 20:44:41 发布

sphw

最新推荐文章于 2023-06-15 20:44:41 发布

阅读量321

点赞数

分类专栏： Linear Algebra

本文链接：https://blog.csdn.net/zxxr123/article/details/106321738

版权

Linear Algebra 专栏收录该内容

3 篇文章 1 订阅

订阅专栏

2.1 Vectors and Linear Equations

The central problem of linear algebra is to solve a system of linear equations which means that the unknown are only multiplied by numbers–we never see $x$ times $y$ .

The column picture of $A x = b :$ a combination of $n$ columns of A produces the vector $b$ .
The row picture of $A x = b :$ $m$ equations from $m$ rows give $m$ planes meeting at $x$ .

$\left[ \begin{matrix} 1 & -2\\ 3 & 2 \end{matrix} \right]\left[\begin{matrix}x\\y\end{matrix}\right]=\left[\begin{matrix}1\\11\end{matrix}\right]$
The column picture:
$x\left[\begin{matrix}1\\3\end{matrix}\right] + y\left[\begin{matrix}-2\\2\end{matrix}\right]=\left[\begin{matrix}1\\11\end{matrix}\right]$
The row picture:
$1\\ 3x+2y=11$

The column picture is done in numpy, when you create new matrix.
np.linalg.inv(A) calculate the inverse of matrix $A$ .

>>> import numpy as np
>>> A = np.array([[1,3],[-2,2]])# The column picture
>>> b = np.array([1,11])
>>> b.dot(np.linalg.inv(A))
array([3., 1.])
>>> np.dot(b,np.linalg.inv(A))
array([3., 1.])

Three Equations in Three Unkonwns

$A x = b$
$x+2y+3z=6\\ 2x+5y+2z=4\\ 6x-3y+z=2$

The Row Picture: Two intersected planes make one line which is intersected with the third plane. This process can do the result.
The Column Picture:
$x\left[\begin{matrix} 1\\2\\6 \end{matrix}\right] + y\left[\begin{matrix} 2\\5\\-3\end{matrix}\right] + z\left[\begin{matrix} 3\\2\\1 \end{matrix}\right]=\left[\begin{matrix} 6\\4\\2 \end{matrix}\right]$

The result should be:
$\left[\begin{matrix} x\\y\\z \end{matrix}\right] = \left[\begin{matrix} 0\\0\\2 \end{matrix}\right]$

>>> import numpy as np
>>> A = np.array([[1,2,6],[2,5,-3],[3,2,1]])
>>> b = np.array([6,4,2])
>>> b.dot(np.linalg.inv(A))
array([0.00000000e+00, 2.77555756e-17, 2.00000000e+00])

The matrix form merge the row picture and column picture.
$\left[\begin{matrix}1&2&3\\ 2&5&2\\ 6&-3&1 \end{matrix}\right]\left[\begin{matrix} x\\y\\z\end{matrix}\right]=\left[\begin{matrix} 6\\4\\2 \end{matrix}\right]$

identity matrix $\bm{I}$ : $\bm{I}\bm{x} = \bm{x}$
Whatever vector identity matrix multiplies, that vector is not changed.
$\bm{I} = \left[\begin{matrix}1&0&0\\0&1&0\\0&0&1\end{matrix}\right]$

Except for the Identity Matrix, the following matrix are all interesting.

Exchange matrix
$\left[\begin{matrix} 0&1\\1&0\end{matrix}\right]\left[\begin{matrix} x\\y\end{matrix}\right]=x\left[\begin{matrix} 0\\1\end{matrix}\right] + y\left[\begin{matrix}1\\0 \end{matrix}\right]=\left[\begin{matrix}x*0 + y*1\\x*1+y*0 \end{matrix}\right]=\left[\begin{matrix}y\\x \end{matrix}\right]$
Rotate every vector by 90°
$\left[\begin{matrix} 0&1\\-1&0 \end{matrix}\right]\left[\begin{matrix} x\\y \end{matrix}\right]=x\left[\begin{matrix}0\\-1\end{matrix}\right] + y\left[\begin{matrix} 1\\0 \end{matrix}\right]=\left[\begin{matrix} x*0+y*1\\x*(-1) + y*0 \end{matrix}\right]=\left[\begin{matrix}y\\-x \end{matrix}\right]$
Rotate every vector by 180°
$\left[\begin{matrix} 0&-1\\-1&0 \end{matrix}\right]\left[\begin{matrix} x\\y\end{matrix}\right]=x\left[\begin{matrix} 0\\-1 \end{matrix}\right] + y\left[\begin{matrix} -1\\0 \end{matrix}\right]=\left[\begin{matrix} x*0 + y * (-1)\\x*(-1) + y*0\end{matrix}\right]=\left[\begin{matrix} -y\\-x \end{matrix}\right]$
Rotate every vector through 45°
$\left[\begin{matrix} \frac{\sqrt(2)}{2} & -\frac{\sqrt(2)}{2} \\\frac{\sqrt(2)}{2}&\frac{\sqrt(2)}{2} \end{matrix}\right]\left[\begin{matrix} x\\y \end{matrix}\right]=x\left[\begin{matrix} \frac{\sqrt(2)}{2}\\\frac{\sqrt(2)}{2} \end{matrix}\right] + y\left[\begin{matrix} -\frac{\sqrt(2)}{2}\\\frac{\sqrt(2)}{2} \end{matrix}\right]=\left[\begin{matrix} x\frac{\sqrt(2)}{2} -y\frac{\sqrt(2)}{2} \\ x\frac{\sqrt(2)}{2}+y\frac{\sqrt(2)}{2} \end{matrix}\right]$
All these problems are solved by the column picture.

Row Picture: Each equation in $A x = b$ gives a line(n = 2) or a plane(n = 3) or a “hyperplane”(n > 3). They intersect at the solution or solutions, if any.

2.2 The Idea of Elimination

The corner entry $a_{11}$ is the first “pivot” and the ratio $a_{21}/a_{11}$ is the first “multiplier.”

Elimination breaks down if zero appears in the pivot. Exchanging two equations may save it.

2.4 Rules for Matrix Operations

$\left[\begin{matrix} a&b\\c&d \end{matrix}\right]\left[\begin{matrix} E&F\\G&H \end{matrix}\right]=\left[\begin{matrix} a\\c \end{matrix}\right]\left[\begin{matrix} E&F \end{matrix}\right]+\left[\begin{matrix} b\\d \end{matrix}\right]\left[\begin{matrix} G&H \end{matrix}\right]=\left[\begin{matrix} aE&aF\\cE&cF \end{matrix}\right] + \left[\begin{matrix} bG&bH\\dG&dH \end{matrix}\right] = \left[\begin{matrix} aE+bG&aF+bH\\cE+dG&cF+dH \end{matrix}\right]$

2.5 Inverse Matrices

Calculating $A^{-1}$ by Gauss-Jordan Eliminations

$\left[\begin{matrix} 2&-1&0\\-1&2&-1\\0&-1&2 \end{matrix}\right]$

$\left[\begin{matrix} 2&-1&0&1&0&0\\-1&2&-1&0&1&0\\0&-1&2&0&0&1 \end{matrix}\right]\rightarrow elimination$
$\rightarrow \left[\begin{matrix} 1&0&0& \frac{3}{4}&\frac{1}{2}&\frac{1}{4} \\0&1&1& \frac{1}{2}&1&\frac{1}{1} \\0&0&1& \frac{1}{4}&\frac{1}{2}&\frac{3}{4} \end{matrix}\right]$

>>> import numpy as np
>>> import sympy as sm
>>> I = np.identity(3, dtype = int)
>>> A = [[2,-1,0], [-1,2,-1],[0,-1,2]]
>>> aug_matrix = sm.Matrix(np.concatenate((A, I), axis=1))
>>> aug_matrix
Matrix([
[ 2, -1,  0, 1, 0, 0],
[-1,  2, -1, 0, 1, 0],
[ 0, -1,  2, 0, 0, 1]])
>>> # Elimination on the augmented matrix [A, I] to the
>>> # reduced echelon form阶梯形矩阵
>>> R, rref_pivots = aug_matrix.rref()
>>> # Pick X = inverse(A) fromthe last n columns of R
>>> R[:,3:]
Matrix([
[3/4, 1/2, 1/4],
[1/2,   1, 1/2],
[1/4, 1/2, 3/4]])

Diagonally dominant matrices are invertible. The diagonally dominant matrices are needed to be following the formula.
$|a_{ij}| > \sum_{j\neq i}|a_{ij}|$
like the following matrix:
$\left[\begin{matrix} 3&1&1\\1&3&1\\1&1&3 \end{matrix}\right]$
Some matrices which are not diagonally dominant matrices are also invertable . like the following matrix:
$\left[\begin{matrix} 2&1&1\\1&2&1\\1&1&3 \end{matrix}\right]$

The inverse of a triangular difference matrix $A$ is a triangular sum matrix $S$ .
$AI=\left[\begin{matrix} 1&0&0&1&0&0\\-1&1&0&0&1&0\\0&-1&1&0&0&1 \end{matrix}\right] \rightarrow\left[\begin{matrix} 1&0&0&1&0&0\\0&1&0&1&1&0\\0&0&1&1&1&1 \end{matrix}\right]=[I A^{-1}]$
$A$ is difference matrix. $A^{-1}$ is sum matrix.

2.6 Elimination $=$ Factorization

>>> from scipy.linalg import pascal, cholesky
>>> # create 4*4 pascal matrix
>>> A = pascal(4)
>>> # Cholesky factorization
>>> L = cholesky(A, lower=True)
>>> U = cholesky(A, lower = False)
>>> # check the the factorization of A, A = LU
>>> L.dot(U) == A
array([[ True,  True,  True,  True],
       [ True,  True,  True,  True],
       [ True,  True,  True,  True],
       [ True,  True,  True,  True]])
>>> A
array([[ 1,  1,  1,  1],
       [ 1,  2,  3,  4],
       [ 1,  3,  6, 10],
       [ 1,  4, 10, 20]], dtype=uint64)
>>> L
array([[1., 0., 0., 0.],
       [1., 1., 0., 0.],
       [1., 2., 1., 0.],
       [1., 3., 3., 1.]])
>>> U
array([[1., 1., 1., 1.],
       [0., 1., 2., 3.],
       [0., 0., 1., 3.],
       [0., 0., 0., 1.]])

2.7 Transpose and Permutations

$AB)^T = B^TA^T$

$A^{-1})^T = (A^T)^{-1}$

A symmetric matrix has $S^T = S$ . This means that $S_{ji} =S_{ij}$

For permutation matrix $P$ , $P^{-1} = P^T$

How to find the permutation matrix $P$ in the formula $P A = L U$ ?
Look down the column for the largest pivot.

>>> from scipy.linalg import lu
>>> import numpy as np
>>> A = np.array([[0,1,1], [1,2,1], [2,7,9]])
>>> P, L, U = lu(A)
>>> A
array([[0, 1, 1],
       [1, 2, 1],
       [2, 7, 9]])
>>> P.dot(L.dot(U))
array([[0., 1., 1.],
       [1., 2., 1.],
       [2., 7., 9.]])