算法 {矩阵DP(矩阵优化递推DP)}

omipus

已于 2024-05-29 16:01:17 修改

阅读量149

点赞数

分类专栏：算法文章标签：矩阵动态规划算法

于 2023-02-11 13:55:35 首次发布

本文链接：https://blog.csdn.net/qq_66485519/article/details/128983109

版权

算法专栏收录该内容

219 篇文章 0 订阅

订阅专栏

算法 {矩阵DP(矩阵优化递推DP)}

矩陣DP

定义

有個若干個DP (比如有3個DP: DP0[ N], DP1[ N], DP2[ N]), 每一個DP的遞推式滿足線性組合, 即 $D P 0/1/2 [i] = k 1 * D P 0 [i - 1] + k 2 * D P 1 [i - 1] + k 3 * D P 2 [i - 1]$ ;

for( int i = 1; i < N; ++i){
	DP0[ i] = k0*DP0[ i-1] + k1*DP1[ i-1] + k2*DP2[ i-1];
	DP1[ i] = k3*DP0[ i-1] + k4*DP1[ i-1] + k5*DP2[ i-1];
	DP2[ i] = k6*DP0[ i-1] + k7*DP1[ i-1] + k8*DP2[ i-1];
}

這個時間是 $O (N)$ 的, 如果 $N$ 很大比如 $> 1 e 8$ 那麼可以把以上過程轉換為矩陣乘法;

DP[3] = { DP0[0], DP1[0], DP2[0]}; (初值)
A[3][3] = { {k0, k3, k6}, {k1, k4, k7}, {k2, k5, k8}};

可以發現 執行矩陣乘法`DP * A = DPP` 會得到DPP[0]=DP0[1] DPP[1]=DP1[1] DPP[2]=DP2[1];
換句話說 `* A`表示: 進行了一次DP的遞推 (即對應樸素算法的for循環);
因此 要得到樸素算法的DP?[ N-1] 即進行了`N-1`次的遞推, 即`DP * A*...*A`(一共有N-1個A);

根據*矩陣乘法*的傳遞律, 比如你要進行5個遞推 那麼他等價於`(DP*A)*(A*A*A*A)`
因此`DP * A^n` 可以用快速冪計算, 有以下两种做法:  
1: 你可以先求出`AA: A^n`, 然後最後執行DP*AA;
2: DP也可以直接參與到快速冪裡, 即`DP*(A^1) * (A^2) * (A^4) * ...`;

错误

比如要快速幂求A^k, 最初答案矩阵设置成全1 这是错误的!
矩阵乘法中的单位1 即单位矩阵, 不是全1, 而是主对角线为1 其他为0;

性质

算法

代碼

//{ ___DP_MatrixMultiply (矩阵优化递推DP)
template< class _T_> std::vector< _T_> ___DP_MatrixMultiply( std::vector<_T_> _initDP, std::vector<std::vector<_T_> > _factor, long long _count){
//< 比如`initDP = [DP0, DP1], factor: [[a,b], [c,d]]`, 则表示DP递推式为`DP0 = a*DP0 + b*DP1, DP1 = c*DP0 + d*DP1`;
//  . `_count`为递推次数, 即`initDP`进行`count`次递推后 结果为*返回值*;
    ASSERT_( _count >= 0, _count);
    ASSERT_( _initDP.size()==_factor.size() && _factor.size()==_factor[0].size());
    int const N = _initDP.size();
    { // `factor`转换为其转置矩阵
        for( int i = 0; i < N; ++i){
            for( int j = i+1; j < N; ++j){
                std::swap( _factor[i][j], _factor[j][i]);
            }
        }
    }
    auto temp1 = _initDP;
    auto temp2 = _factor;
    while( _count > 0){
        if( _count & 1){ // DP *= factor;
            for( int col = 0; col < N; ++col){
                temp1[ col] = 0;
                for( int i = 0; i < N; ++i){
                    temp1[ col] += (_initDP[ i] * _factor[ i][ col]);
                }
            }
            _initDP = temp1;
        }
        _count >>= 1;
        { // factor *= factor
            for( int row = 0; row < N; ++row){
                for( int col = 0; col < N; ++col){
                    temp2[ row][ col] = 0;
                    for( int i = 0; i < N; ++i){
                        temp2[ row][ col] += (_factor[ row][ i] * _factor[ i][ col]);
                    }
                }
            }
            _factor = temp2;
        }
    }
    return _initDP;
}
//} ___DP_MatrixMultiply

例题

@LINK: https://editor.csdn.net/md/?not_checkout=1&articleId=139273191;
求4.....4(长度为1e9)这个10进制数的取模值;
DP矩阵里放一个常数;

@DELI;

@LINK: https://editor.csdn.net/md/?not_checkout=1&articleId=132818193;`;

@DELI;

AcWing-1305. GT考试

AcWing-1304. 佳佳的斐波那契

AcWing-1303. 斐波那契前 n 项和

筆記

The nature of Matrix-Multiplication in Algorithm is $D P$ , so it is vital to review some notion of $D P$ :
0 $T im e (O r d er)$ , the DP-Value of Time- $i$ ( $D P [i]$ ) always depends on $D P [< i]$ ;
. . For example, $F ib o na cc i [i] = F ib o na cc i [i - 1] + F ib o na cc i [i - 2]$ ;
1 State, at every Time- $i$ , $D P$ has several $St a t es$ ;
. . e.g., $D P [n]$ has just one-state ( $D P [i]$ ); $D P [n] [m]$ has $m$ -states $D P [i] [0, 1, ..., m - 1]$ ;

@Delimiter

The Matrix-Multiplication in Algorithm must be in the form:
$Dp[a_0, a_1,..., a_{n-1}] * Factor\begin{bmatrix} k_{0,0}& k_{0,1} & ... & k_{0,n-1} \\ ... \\ k_{n-1,0} & k_{n-1,1} & ... & k_{n-1,n-1} \end{bmatrix} = Result[b_0, b_1,..., b_{n-1}]$

$D p$ must be a $1 * n$ Row-Matrix;
$F a c t or$ must be a $n * n$ Square-Matrix;
Then, $R es u lt$ is a $1 * n$ Row-Matrix;
We called $D p, R es u lt$ the $\text{ DP-Matrix}$

If we denote $Dp[a_0, a_1,..., a_{n-1}]$ as the DP-Time- $i$ (i.e., $D P [i]$ ), then $Result[b_0, b_1,..., b_{n-1}]$ would be the DP-Time- $i + 1$ (i.e., $D P [i + 1]$ );
. More specifically, the phrase $Dp[a_0, a_1,..., a_{n-1}] = DP[i]$ means that, $D P [i]$ has $n$ $St a t es$ $a_0, ..., a_{n-1}$ ;
. So, $a_i, b_i$ denote the same DP-State, just the Time of $a_i$ is $k$ and the Time of $b_i$ is $k + 1$ ;

Then, $b_i = C_0 * a_0 + C_1 * a_1 + ... + C_{n-1} * a_{n-1}$ where $C_i$ are all $C o n s t an t s$ ;
. That is, $bi$ is a Linear-Combination of all $a_i$ ;
. Cuz $C_i$ must be $C o n s t an t s$ , the DP-State-Transformation must be in the form of Linear-Combination (i.e., $\text{Constant * DP-State}$ ) (if not, that DP not fit the Matrix-Multiplication);
. . e.g., $D P [i] = D P [i - 1] * D P [i - 2]$ is Infeasible;

The utmost point is to clarify the correspondence between The-Time of the matrix- $D p [...]$ and The-Time of the DP of all its elements $a_i$ ;
. e.g., Calculate the $\sum_{j = 1}^{i} F[j]$ where $F []$ is the Fibonacci;
. . Suppose the DP-Time of the matrix $D p [...]$ is $i$ , and we let $a_0 = F[i], a_1 = F[i+1], a_2 = S[i]$ (suppose these three values are already gained);
. . We need to update $b_i$ correctly, that is, to determine those $k_{i,j}$ in the matrix- $F a c t or$ ;
. . Then, the matrix- $R es u lt$ would be The-Time $i + 1$ (cuz the matrix- $D p$ is Time- $i$ ); Therefore, $b_0$ denotes $F [i + 1]$ , $b_1 = F[i + 2]$ , $b_2 = S[i + 1]$ ;
. . We need to find the relation between $a_i, b_j$ , so $b_0 = a_1$ , $b_1 = a_0 + a_1$ , $b_2 = a_2 + a_1$ ; This is valid due to the form of every formula is Linear-Combination;

@Delimiter

Initially, if we set the matrix- $D p [...]$ as DP-Time- $0$ , we wanna it to be DP-Time- $i$ ;
. That is, $D p [...] * F a c t or * F a c t or * ... * F a c t or$ where the number of $F a c t or$ is $i$ ; the result would be the DP-Time- $i$ ;
. Matrix-Multiplication satisfies the Combination-Property, then it can be transformed to $Dp[...] * (Factor ^ i)$ , we can use Binary-Exponentiation to solve it;

@DELI;

$\text{Property-0}$

The elements $a_i$ in the matrix- $Dp[a_0, a_1, ...]$ can be divided into two-types:
0 $a_i$ is a DP-State; The value of $a_i$ depends on the DP-Time of the matrix- $D p [...]$ (of course, $a_i$ is not a Constant);
1 $a_i$ is a Constant; Whatever the DP-Time of the matrix- $D p [...]$ is, $a_i$ is always the same;

For example, $d p [i] = 2 * d p [i - 1] + 5$ , then we transform it to $2 * d p [i - 1] + 1 * 5$ which satisfying the requirement that $2, 1$ are Constants;
. Let the matrix- $D p [a, b]$ be the DP-Time- $i$ where $a = d p [i], b = 5$ ;
. Then the matrix- $R es u lt [c, d]$ be the DP-Time- $i + 1$ where $c = d p [i + 1] = 2 * d p [i] + 5, d = 5$ , so $c = 2 * a + 1 * b, d = b$ satisfying the Linear-Combination;

@Delimiter

$\text{Property-1}$

Once the DP-Time of the matrix- $D p [...]$ is settled (suppose it is $i$ ), then the DP-Time of all its DP-Elements $a_i$ is also fixed;
. e.g., Let the DP-Time of the matrix- $D p [a, b, c, d]$ is $i$ , where $a = dp_1[i], b = dp_2[i-1], c = dp_3[i+1]$ and $d$ is a Constant-Element;
. . Now, given you another matrix- $T [d, e, f]$ whose DP-Time is $j$ , then we would know that, $d = dp_1[j], e = dp_2[j - 1], f = dp_3[j + 1]$ $;