Eigen学习教程(三)

名字好难想啊啊啊啊

已于 2022-10-12 22:49:07 修改

阅读量3.6k

点赞数 3

分类专栏： Eigen学习

于 2022-04-20 09:47:07 首次发布

原文链接：https://github.com/qixianyu-buaa/EigenChineseDocument

版权

Eigen学习专栏收录该内容

4 篇文章 9 订阅

订阅专栏

Eigen学习教程(三)

3. 稠密线性问题与分解

本节说明了如何求解线性系统，计算各种分解，例如LU，QR，SVD，本征分解…阅读此节后，请不要错过我们的密集矩阵分解目录。
基本线性求解 Ax=b。
该解决方案：可将各种分解之间进行选择，取决于你的矩阵一个样子，取决于你是否赞成速度或准确性。但是，让我们从一个适用于所有情况的示例开始，这是一个很好的折衷方案。

3.1 线性代数与分解

3.1.1基本线性求解

        Matrix3f A;
        Vector3f b;
        A << 1, 2, 3, 4, 5, 6, 7, 8, 10;
        b << 3, 3, 4;
        cout << "Here is the matrix A:\n"
             << A << endl;
        cout << "Here is the vector b:\n"
             << b << endl;
        Vector3f x = A.colPivHouseholderQr().solve(b);
        cout << "The solution is:\n"
             << x << endl;

        // Output is:
        // Here is the matrix A:
        //  1  2  3
        //  4  5  6
        //  7  8 10
        // Here is the vector b:
        // 3
        // 3
        // 4
        // The solution is:
        // -2
        //  1
        //  1

在此示例中，colPivHouseholderQr()方法返回类ColPivHouseholderQR的对象。由于此处的矩阵类型为Matrix3f，因此该行可能已替换为:

ColPivHouseholderQR <Matrix3f> dec(A);
Vector3f x = dec.solve(b);

在这里，ColPivHouseholderQR是具有选择列主元功能的QR分解。这是本教程的一个不错的折衷方案，因为它适用于所有矩阵，而且速度非常快。这是一些其他分解表，您可以根据矩阵和要进行的权衡选择:

        Decomposition	Method	          Requirements on the matrix	 Speed(small-to-medium)	                Speed (large)	                Accuracy
        PartialPivLU	                    partialPivLu()	                     Invertible	                                                                        ++	                                                ++	                                        +
        FullPivLU	                    fullPivLu()	                         None	                                                                            -	                                                     - -	                                   +++
        HouseholderQR	householderQr()	                                        None	                                                                        ++	                                                ++                                              +
        ColPivHouseholderQR	          colPivHouseholderQr()	               None	                                                                +	                                                     -	                                           +++
        FullPivHouseholderQR	          fullPivHouseholderQr()	               None                                                            	-	                                                    - -	                                                +++
        CompleteOrthogonalDecomposition	completeOrthogonalDecomposition()	None	+	-	+++
        LLT	llt()	                                                                Positive definite	                                                                +++	+++	+
        LDLT	ldlt()	                                        Positive or negative semidefinite	                                                +++	+	++
        BDCSVD	bdcSvd()	None	-	-	+++
        JacobiSVD	jacobiSvd()	None	-	- - -	+++

所有这些分解都提供了一个solve()方法，该方法与上述示例一样。例如，如果您的矩阵是正定的，则上表说明LLT或LDLT分解是一个很好的选择。这是一个示例，也说明使用通用矩阵（而非矢量）作为右手边是可能的。

       Matrix2f A, b;
       A << 2, -1, -1, 3;
       b << 1, 2, 3, 1;
       cout << "Here is the matrix A:\n"
            << A << endl;
       cout << "Here is the right hand side b:\n"
            << b << endl;
       Matrix2f x = A.ldlt().solve(b);
       cout << "The solution is:\n"
            << x << endl;
       //Output is:
       // Here is the matrix A:
       //  2 -1
       // -1  3
       // Here is the right hand side b:
       // 1 2
       // 3 1
       // The solution is:
       // 1.2 1.4
       // 1.4 0.8

3.1.2 检查是否有解

计算相对误差的方法, 只有您知道要允许解决方案被视为有效的误差范围。因此，Eigen允许您自己进行此计算，如以下示例所示:

        MatrixXd A = MatrixXd::Random(100, 100);
        MatrixXd b = MatrixXd::Random(100, 50);
        MatrixXd x = A.fullPivLu().solve(b);
        double relative_error = (A * x - b).norm() / b.norm(); // norm() is L2 norm
        cout << "The relative error is:\n"
             << relative_error << endl;
        // Output is:
        // The relative error is:
        // 2.31495e-14

3.1.3 计算特征值和特征向量

您需要在此处进行特征分解，确保检查矩阵是否是自伴随的，在数学里，作用于一个有限维的内积空间，一个自伴算子(self-adjoint operator)等于自己的伴随算子；等价地说，表达自伴算子的矩阵是埃尔米特矩阵。埃尔米特矩阵等于自己的共轭转置。根据有限维的谱定理，必定存在着一个正交归一基，可以表达自伴算子为一个实值的对角矩阵。就像在这些问题中经常发生的那样。这是一个使用SelfAdjointEigenSolver的示例，可以使用EigenSolver或ComplexEigenSolver轻松地将其应用于一般矩阵。特征值和特征向量的计算不一定会收敛，但是这种收敛失败的情况很少。调用info()就是为了检查这种可能性。

        Matrix2f A;
        A << 1, 2, 2, 3;
        cout << "Here is the matrix A:\n"
             << A << endl;
        SelfAdjointEigenSolver<Matrix2f> eigensolver(A);
        if (eigensolver.info() != Success)
                abort();
        cout << "The eigenvalues of A are:\n"
             << eigensolver.eigenvalues() << endl;
        cout << "Here's a matrix whose columns are eigenvectors of A \n"
             << "corresponding to these eigenvalues:\n"
             << eigensolver.eigenvectors() << endl;
        //Output is:
        // Here is the matrix A:
        // 1 2
        // 2 3
        // The eigenvalues of A are:
        // -0.236
        //  4.24
        // Here's a matrix whose columns are eigenvectors of A
        // corresponding to these eigenvalues:
        // -0.851 -0.526
        //  0.526 -0.851

3.1.4 计算逆与行列式

首先，请确保您确实想要这个。尽管逆和行列式是基本的数学概念，但在数值线性代数中，它们不如在纯数学中流行。逆计算通常可以用solve()操作代替，而行列式通常不是检查矩阵是否可逆的好方法。但是，对于非常小的矩阵，上述条件是不正确的，并且逆和行列式可能非常有用。尽管某些分解（例如PartialPivLU和FullPivLU）提供了inverse()和determinant()方法，但您也可以直接在矩阵上调用inverse()和determinant()。如果矩阵的固定大小很小(最多为4x4)，这将使Eigen避免执行LU分解，而应使用对此类小矩阵更有效的公式。

        Matrix3f A;
        A << 1, 2, 1,
             2, 1, 0,
            -1, 1, 2;
        cout << "Here is the matrix A:\n"
             << A << endl;
        cout << "The determinant of A is " << A.determinant() << endl;
        cout << "The inverse of A is:\n"
             << A.inverse() << endl;

        // Output is:
        // Here is the matrix A:
        //  1  2  1
        //  2  1  0
        // -1  1  2
        // The determinant of A is -3
        // The inverse of A is:
        // -0.667      1  0.333
        //   1.33     -1 -0.667
        //     -1      1      1

3.1.5 最小二乘法求解

最小二乘求解的最准确方法是SVD分解。Eigen提供了两种实现。推荐的对象是BDCSVD类，它可以很好地解决较大的问题，并自动退回到JacobiSVD类以解决较小的问题。对于这两个类，它们的resolve()方法都在进行最小二乘求解。

        MatrixXf A = MatrixXf::Random(3, 2);
        cout << "Here is the matrix A:\n"
             << A << endl;
        VectorXf b = VectorXf::Random(3);
        cout << "Here is the right hand side b:\n"
             << b << endl;
        cout << "The least-squares solution is:\n"
             << A.bdcSvd(ComputeThinU | ComputeThinV).solve(b) << endl;

        // Output is:
        // Here is the matrix A:
        //   0.68  0.597
        // -0.211  0.823
        //  0.566 -0.605
        // Here is the right hand side b:
        //  -0.33
        //  0.536
        // -0.444
        // The least-squares solution is:
        // -0.67
        // 0.314

可能更快但可靠性较低的另一种方法是使用矩阵的Cholesky分解或QR分解。我们关于最小二乘法求解的页面有更多详细信息。

3.1.6 将计算与构造分离

在以上示例中，在构造分解对象的同时计算了分解。但是，在某些情况下，您可能希望将这两件事分开，例如，如果在构造时不知道要分解的矩阵，则可能会需要将它们分开。或者您想重用现有的分解对象。使之成为可能的原因是: 所有分解都有默认的构造函数，所有分解都具有执行计算的compute(matrix)方法，并且可以在已计算的分解中再次调用该方法，以将其重新初始化。

        Matrix2f A, b;
        LLT<Matrix2f> llt;
        A << 2, -1, -1, 3;
        b << 1, 2, 3, 1;
        cout << "Here is the matrix A:\n"
             << A << endl;
        cout << "Here is the right hand side b:\n"
             << b << endl;
        cout << "Computing LLT decomposition..." << endl;
        llt.compute(A);
        cout << "The solution is:\n"
             << llt.solve(b) << endl;
        A(1, 1) ++;
        cout << "The matrix A is now:\n"
             << A << endl;
        cout << "Computing LLT decomposition..." << endl;
        llt.compute(A);
        cout << "The solution is now:\n"
             << llt.solve(b) << endl;

最后，您可以告诉分解构造函数预先分配存储空间以分解给定大小的矩阵，以便在随后分解此类矩阵时，不执行动态内存分配(当然，如果您使用的是固定大小的矩阵，则不存在动态内存分配完全发生)。只需将大小传递给分解构造函数即可完成，如以下示例所示:

       HouseholderQR<MatrixXf> qr(50, 50);
       MatrixXf A = MatrixXf ::Random(50, 50);
       qr.compute(A); //没有动态内存分配

3.1.7 矩阵秩

某些分解是揭示矩阵秩的。这些通常也是在非满秩矩阵（在方形情况下表示奇异矩阵）的情况下表现最佳的分解。秩揭示分解至少提供了rank()方法。它们还可以提供方便的方法，例如isInvertible()，并且还提供一些方法来计算矩阵的核(零空间)和像(列空间)，就像FullPivLU那样:

        Matrix3f A;
        A << 1, 2, 5,
             2, 1, 4,
             3, 0, 3;
        cout << "Here is the matrix A:\n"
             << A << endl;
        FullPivLU<Matrix3f> lu_decomp(A);
        cout << "The rank of A is " << lu_decomp.rank() << endl;
        cout << "Here is a matrix whose columns form a basis of the null-space of A:\n"
             << lu_decomp.kernel() << endl;
        cout << "Here is a matrix whose columns form a basis of the column-space of A:\n"
             << lu_decomp.image(A) << endl; // yes, have to pass the original A

        // Output is:
        // Here is the matrix A:
        // 1 2 5
        // 2 1 4
        // 3 0 3
        // The rank of A is 2
        // Here is a matrix whose columns form a basis of the null-space of A:
        //  0.5
        //    1
        // -0.5
        // Here is a matrix whose columns form a basis of the column-space of A:
        // 5 1
        // 4 2
        // 3 3

当然，任何秩计算都取决于对任意阈值的选择，因为实际上没有浮点矩阵恰好是秩不足的。Eigen选择一个明智的默认阈值，该阈值取决于分解，但通常是对角线大小乘以机器ε。虽然这是我们可以选择的最佳默认值，但只有您知道您的应用程序的正确阈值是多少。您可以通过在调用rank()或需要使用此阈值的任何其他方法之前在分解对象上调用setThreshold()来进行设置。分解本身(即compute()方法)与阈值无关。更改阈值后，无需重新计算分解。

        Matrix2d A;
        A << 2, 1,
             2, 0.9999999999;
        FullPivLU<Matrix2d> lu(A);
        cout << "By default, the rank of A is found to be " << lu.rank() << endl;
        lu.setThreshold(1e-5);
        cout << "With threshold 1e-5, the rank of A is found to be " << lu.rank() << endl;

        // Output is:
        // By default, the rank of A is found to be 2
        // With threshold 1e-5, the rank of A is found to be 1

3.2 求解线性最小二乘问题

本页介绍如何使用本征求解线性最小二乘系统。一个超定方程组，例如Ax = b，没有解。在这种情况下，在差异Ax - b尽可能小的意义上，搜索最接近解的向量x是有意义的。该x称为最小二乘解(如果使用欧几里得范数)。本页讨论的三种方法是SVD分解，QR分解和正态方程。其中，SVD分解通常最准确但最慢，正则方程(normal equations)最快但最不准确，QR分解介于两者之间。

3.2.1 使用ESVD分解

BDCSVD类中的solve()中方法可以直接用来求解线性最小二乘系统。仅计算奇异值（此类的默认值）是不够的。您还需要奇异矢量，但是稀疏SVD分解足以计算最小二乘解:

        MatrixXf A = MatrixXf::Random(3, 2);
        cout << "Here is the matrix A:\n"
             << A << endl;
        VectorXf b = VectorXf::Random(3);
        cout << "Here is the right hand side b:\n"
             << b << endl;
        cout << "The least-squares solution is:\n"
             << A.bdcSvd(ComputeThinU | ComputeThinV).solve(b) << endl;
        // Output is:
        // Here is the matrix A:
        //   0.68  0.597
        // -0.211  0.823
        //  0.566 -0.605
        // Here is the right hand side b:
        //  -0.33
        //  0.536
        // -0.444
        // The least-squares solution is:
        // -0.67
        // 0.314

3.2.2 使用QR分解

QR分解类中的resolve()方法还计算最小二乘解。有3种QR分解类：HouseholderQR(无pivoting，因此快速但不稳定)，ColPivHouseholderQR(列枢轴，因此较慢但更准确)和FullPivHouseholderQR(全枢轴，因此最慢且最稳定)。

        MatrixXf A = MatrixXf::Random(3, 2);
        VectorXf b = VectorXf::Random(3);
        cout << "The solution using the QR decomposition is:\n"
             << A.colPivHouseholderQr().solve(b) << endl;
        // Output is:
        // The solution using the QR decomposition is:
        // -0.67
        // 0.314

3.2.3 使用范数方程

找到Ax = b的最小二乘解等效于求解法线方程 $A^T Ax = A^T b$ 。如果矩阵A是病态的，那么这不是一个好方法，因为 $A^T A$ 的条件数是 $A$ 的条件数的平方。这意味着与使用其他方法相比，使用正则方程式丢失的数字要多两倍。

        MatrixXf A = MatrixXf::Random(3, 2);
        VectorXf b = VectorXf::Random(3);
        cout << "The solution using normal equations is:\n"
             << (A.transpose() * A).ldlt().solve(A.transpose() * b) << endl;

3.4 就地分解

从Eigen 3.3开始，LU，Cholesky和QR分解可以就地进行操作，即直接在给定的输入矩阵内进行。当处理大量矩阵或可用内存非常有限（嵌入式系统）时，此功能特别有用。为此，必须使用Ref <>矩阵类型实例化各个分解类，并且必须使用输入矩阵作为参数来构造分解对象。作为示例，让我们考虑partial pivoting的LU分解。

        // 声明一个2x2矩阵 A:

        MatrixXd A(2, 2);
        A << 2, -1, 1, 3;
        cout << "Here is the input matrix A before decomposition:\n"
             << A << endl;
        // Output is:
        // Here is the input matrix A before decomposition:
        //  2 -1
        //  1  3

毫不奇怪！然后，声明我们的Inplace LU对象lu，并检查矩阵的内容A: 这相当于把A和lu绑定了，所以后文中即使计算A1，A1的内容也不会改变。

       PartialPivLU<Ref<MatrixXd>> lu(A);
       cout << "Here is the input matrix A after decomposition:\n"
            << A << endl;

       // Output is:
       // Here is the input matrix A after decomposition:
       //   2  -1
       // 0.5 3.5

在这里，lu对象计算并将L和U因数存储在矩阵所保存的内存中A。因此，系数A在分解过程中已被破坏，并由L和U因子代替，可以验证:

        // 也就是分解的结果保存在矩阵中
        cout << "Here is the matrix storing the L and U factors:\n"
             << lu.matrixLU() << endl;
        // Output is:
        // Here is the matrix storing the L and U factors:
        //   2  -1
        // 0.5 3.5
        // 然后，该lu对象可以像往常一样使用，例如解决Ax = b问题：
        MatrixXd A0(2, 2);
        A0 << 2, -1, 1, 3;
        VectorXd b(2);
        b << 1, 2;
        VectorXd x = lu.solve(b);
        cout << "Residual: " << (A0 * x - b).norm() << endl;
        // Output is:
        // Residual: 0

在这里，由于原始矩阵的内容A已丢失，因此我们必须声明一个新矩阵A0以验证结果。由于在A和lu之间共享内存，因此修改矩阵A将导致lu无效。可以通过修改的内容A并尝试再次解决初始问题来轻松验证这一点:

        A << 3, 4, -2, 1;
        x = lu.solve(b);
        // Output is:
        // Residual: 15.8114

请注意，这没有共享指针，所以保持输入矩阵A和lu具有共同的生命周期是用户的责任。如果要使用修改后的A更新因式分解，则必须像往常一样调用compute方法:

        A0 = A; // save A
        lu.compute(A);
        x = lu.solve(b);
        cout << "Residual: " << (A0 * x - b).norm() << endl;
        // Output is:
        // Residual: 0

请注意，调用compute不会更改该lu对象引用的内存。因此，如果使用A1不同于的另一个矩阵调用计算方法A，则A1不会修改内容。这仍然A是将用于存储矩阵的L和U因子的内容A1。可以很容易地验证如下:

        MatrixXd A1(2, 2);
        A1 << 5, -2, 3, 4;
        lu.compute(A1);
        cout << "Here is the input matrix A1 after decomposition:\n"
             << A1 << endl;
        // Output is:
        // Here is the input matrix A1 after decomposition:
        //  5 -2
        //  3  4

矩阵A1是不变的，因此可以求解A1 *x = b，直接检查残差而无需任何副本A1 :

        x = lu.solve(b);
        cout << "Residual: " << (A1 * x - b).norm() << endl;
        // Output is:
        //Residual: 2.48253e-16

这是支持此就位机制的矩阵分解列表:

        // class LLT
        // class LDLT
        // class PartialPivLU
        // class FullPivLU
        // class HouseholderQR
        // class ColPivHouseholderQR
        // class FullPivHouseholderQR
        // class CompleteOrthogonalDecomposition

名字好难想啊啊啊啊

关注

3
点赞
踩
21

收藏

觉得还不错? 一键收藏
0
评论
Eigen学习教程(三)

Eigen学习教程(三)3. 稠密线性问题与分解本节说明了如何求解线性系统，计算各种分解，例如LU，QR，SVD，本征分解…阅读此节后，请不要错过我们的密集矩阵分解目录。基本线性求解 Ax=b。该解决方案：可将各种分解之间进行选择，取决于你的矩阵一个样子，取决于你是否赞成速度或准确性。但是，让我们从一个适用于所有情况的示例开始，这是一个很好的折衷方案。3.1 基本线性求解 Matrix3f A; Vector3f b; A << 1, 2
复制链接

扫一扫