t r ( A ⋅ ( B ⊙ C ) ) = t r ( ( A ⊙ B T ) ⋅ C ) tr\left(A\cdot\left(B\odot C\right)\right)=tr\left(\left(A\odot B^{T}\right)\cdot C\right) tr(A⋅(B⊙C))=tr((A⊙BT)⋅C)
t r ( A ⋅ ( B ∗ v a l i d C ) ) = t r ( ( A ∗ f u l l B r o t T ) ⋅ C ) = t r ( ( A ∗ v a l i d C r o t T ) ⋅ B ) tr\left(A\cdot\left(B*_{valid} C\right)\right)=tr\left(\left(A*_{full} B_{rot}^{T}\right)\cdot C\right)=tr\left(\left(A*_{valid} C_{rot}^{T}\right)\cdot B\right) tr(A⋅(B∗validC))=tr((A∗fullBrotT)⋅C)=tr((A∗validCrotT)⋅B) 其中 : A ∈ R m − n + 1 × m − n + 1 , B ∈ R n × n , C ∈ R m × m , 且 m ≥ n ≥ 2 其中:A\in\mathbb{R}^{m-n+1\times m-n+1},B\in\mathbb{R}^{n\times n},C\in\mathbb{R}^{m\times m},且m\ge n\ge 2 其中:A∈Rm−n+1×m−n+1,B∈Rn×n,C∈Rm×m,且m≥n≥2
t r ( A ⋅ ( B ∗ f u l l C ) ) = t r ( ( A ∗ v a l i d B r o t T ) ⋅ C ) = t r ( ( A ∗ v a l i d C r o t T ) ⋅ B ) tr\left(A\cdot\left(B*_{full} C\right)\right)=tr\left(\left(A*_{valid} B_{rot}^{T}\right)\cdot C\right)=tr\left(\left(A*_{valid} C_{rot}^{T}\right)\cdot B\right) tr(A⋅(B∗fullC))=tr((A∗validBrotT)⋅C)=tr((A∗validCrotT)⋅B) 其中 : A ∈ R m + n − 1 × m + n − 1 , B ∈ R n × n , C ∈ R m × m , 且 m ≥ 2 , n ≥ 2 其中:A\in\mathbb{R}^{m+n-1\times m+n-1},B\in\mathbb{R}^{n\times n},C\in\mathbb{R}^{m\times m},且m\ge 2, n\ge 2 其中:A∈Rm+n−1×m+n−1,B∈Rn×n,C∈Rm×m,且m≥2,n≥2
A r o t T = ( A r o t ) T = ( A T ) r o t A_{rot}^{T}=\left(A_{rot}\right)^{T}=\left(A^{T}\right)_{rot} ArotT=(Arot)T=(AT)rot
d ( A ∗ { f u l l O R v a l i d } B ) = d A ∗ { f u l l O R v a l i d } B + A ∗ { f u l l O R v a l i d } d B d(A*_{\{full\ OR\ valid\}}B)=dA*_{\{full\ OR\ valid\}}B+A*_{\{full\ OR\ valid\}}dB d(A∗{full OR valid}B)=dA∗{full OR valid}B+A∗{full OR valid}dB
d ( A ⋅ B ) = d A ⋅ B + A ⋅ d B d(A\cdot B)=dA\cdot B+A\cdot dB d(A⋅B)=dA⋅B+A⋅dB
d ( A ⊙ B ) = d A ⊙ B + A ⊙ d B d(A\odot B)=dA\odot B+A\odot dB d(A⊙B)=dA⊙B+A⊙dB
d f = t r ( ∂ f ∂ X T ⋅ d X ) df=tr\left(\frac{\partial f}{\partial X}^{T}\cdot dX\right) df=tr(∂X∂fT⋅dX)
d t r ( X ) = t r ( d X ) dtr(X)=tr(dX) dtr(X)=tr(dX)
设 A , B , C 为四维数组 , 设A,B,C为四维数组, 设A,B,C为四维数组, A ∈ R m × n × h × h , B ∈ R m × p × l × l , C ∈ R p × n × k × k , A\in\mathbb{R}^{m\times n\times h\times h},B\in\mathbb{R}^{m\times p\times l\times l},C\in\mathbb{R}^{p\times n\times k\times k}, A∈Rm×n×h×h,B∈Rm×p×l×l,C∈Rp×n×k×k, A = [ α 11 α 12 … α 1 n α 21 α 22 … α 2 n ⋮ ⋮ ⋱ ⋮ α m 1 α m 2 … α m n ] , A=\begin{bmatrix} \alpha_{11}& \alpha_{12}& \dots& \alpha_{1n}\\ \alpha_{21}& \alpha_{22}& \dots& \alpha_{2n}\\ \vdots& \vdots& \ddots & \vdots\\ \alpha_{m1}& \alpha_{m2}& \dots& \alpha_{mn} \end{bmatrix}, A= α11α21⋮αm1α12α22⋮αm2……⋱…α1nα2n⋮αmn , α i j ∈ R h × h ( i = 1 , 2 , … , m , j = 1 , 2 , … , n ) , \alpha_{ij}\in\mathbb{R}^{h\times h}\left (i=1,2,\dots,m,j=1,2,\dots,n\right ), αij∈Rh×h(i=1,2,…,m,j=1,2,…,n), 及 : α i j = [ a 11 a 12 … a 1 h a 21 a 22 … a 2 h ⋮ ⋮ ⋱ ⋮ a h 1 a h 2 … a h h ] , 及:\alpha_{ij}=\begin{bmatrix} a_{11}& a_{12}& \dots& a_{1h}\\ a_{21}& a_{22}& \dots& a_{2h}\\ \vdots& \vdots& \ddots & \vdots\\ a_{h1}& a_{h2}& \dots& a_{hh} \end{bmatrix}, 及:αij= a11a21⋮ah1a12a22⋮ah2……⋱…a1ha2h⋮ahh , B = [ β 11 β 12 … β 1 p β 21 β 22 … β 2 p ⋮ ⋮ ⋱ ⋮ β m 1 β m 2 … β m p ] , B=\begin{bmatrix} \beta_{11}& \beta_{12}& \dots& \beta_{1p}\\ \beta_{21}& \beta_{22}& \dots& \beta_{2p}\\ \vdots& \vdots& \ddots & \vdots\\ \beta_{m1}& \beta_{m2}& \dots& \beta_{mp} \end{bmatrix}, B= β11β21⋮βm1β12β22⋮βm2……⋱…β1pβ2p⋮βmp , β i j ∈ R l × l ( i = 1 , 2 , … , m , j = 1 , 2 , … , p ) , \beta_{ij}\in\mathbb{R}^{l\times l}\left (i=1,2,\dots,m,j=1,2,\dots,p\right ), βij∈Rl×l(i=1,2,…,m,j=1,2,…,p), C = [ γ 11 γ 12 … γ 1 n γ 21 γ 22 … γ 2 n ⋮ ⋮ ⋱ ⋮ γ p 1 γ p 2 … γ p n ] , C=\begin{bmatrix} \gamma_{11}& \gamma_{12}& \dots& \gamma_{1n}\\ \gamma_{21}& \gamma_{22}& \dots& \gamma_{2n}\\ \vdots& \vdots& \ddots & \vdots\\ \gamma_{p1}& \gamma_{p2}& \dots& \gamma_{pn} \end{bmatrix}, C= γ11γ21⋮γp1γ12γ22⋮γp2……⋱…γ1nγ2n⋮γpn , γ i j ∈ R k × k ( i = 1 , 2 , … , p , j = 1 , 2 , … , n ) . \gamma_{ij}\in\mathbb{R}^{k\times k}\left (i=1,2,\dots,p,j=1,2,\dots,n\right ). γij∈Rk×k(i=1,2,…,p,j=1,2,…,n). A 、 B 、 C 可视为元素类型为二维数组的二维数组 . A、B、C可视为元素类型为二维数组的二维数组. A、B、C可视为元素类型为二维数组的二维数组. 定义四维数组卷积运算 A = B ∗ C ( ∗ 包括 ∗ v a l i d 与 ∗ f u l l ) 为 : 定义四维数组卷积运算A=B*C(*包括*_{valid} 与*_{full})为: 定义四维数组卷积运算A=B∗C(∗包括∗valid与∗full)为: [ β 11 β 12 … β 1 p β 21 β 22 … β 2 p ⋮ ⋮ ⋱ ⋮ β m 1 β m 2 … β m p ] ∗ [ γ 11 γ 12 … γ p n γ 21 γ 22 … γ p n ⋮ ⋮ ⋱ ⋮ γ p 1 γ p 2 … γ p n ] = \begin{bmatrix} \beta_{11}& \beta_{12}& \dots& \beta_{1p}\\ \beta_{21}& \beta_{22}& \dots& \beta_{2p}\\ \vdots& \vdots& \ddots & \vdots\\ \beta_{m1}& \beta_{m2}& \dots& \beta_{mp} \end{bmatrix} * \begin{bmatrix} \gamma_{11}& \gamma_{12}& \dots& \gamma_{pn}\\ \gamma_{21}& \gamma_{22}& \dots& \gamma_{pn}\\ \vdots& \vdots& \ddots & \vdots\\ \gamma_{p1}& \gamma_{p2}& \dots& \gamma_{pn} \end{bmatrix} = β11β21⋮βm1β12β22⋮βm2……⋱…β1pβ2p⋮βmp ∗ γ11γ21⋮γp1γ12γ22⋮γp2……⋱…γpnγpn⋮γpn = [ ∑ i = 1 p β 1 i ∗ γ i 1 ∑ i = 1 p β 1 i ∗ γ i 2 … ∑ i = 1 p β 1 i ∗ γ i n ∑ i = 1 p β 2 i ∗ γ i 1 ∑ i = 1 p β 2 i ∗ γ i 2 … ∑ i = 1 p β 2 i ∗ γ i n ⋮ ⋮ ⋱ ⋮ ∑ i = 1 p β m i ∗ γ i 1 ∑ i = 1 p β m i ∗ γ i 2 … ∑ i = 1 p β m i ∗ γ i n ] \begin{bmatrix} \sum_{i=1}^{p}\beta_{1i}*\gamma_{i1} & \sum_{i=1}^{p}\beta_{1i}*\gamma_{i2}& \dots & \sum_{i=1}^{p}\beta_{1i}*\gamma_{in}\\ \sum_{i=1}^{p}\beta_{2i}*\gamma_{i1} & \sum_{i=1}^{p}\beta_{2i}*\gamma_{i2}& \dots & \sum_{i=1}^{p}\beta_{2i}*\gamma_{in}\\ \vdots& \vdots& \ddots& \vdots\\ \sum_{i=1}^{p}\beta_{mi}*\gamma_{i1} & \sum_{i=1}^{p}\beta_{mi}*\gamma_{i2}& \dots & \sum_{i=1}^{p}\beta_{mi}*\gamma_{in} \end{bmatrix} ∑i=1pβ1i∗γi1∑i=1pβ2i∗γi1⋮∑i=1pβmi∗γi1∑i=1pβ1i∗γi2∑i=1pβ2i∗γi2⋮∑i=1pβmi∗γi2……⋱…∑i=1pβ1i∗γin∑i=1pβ2i∗γin⋮∑i=1pβmi∗γin 例如 : [ a 11 a 12 a 21 a 22 ] ∗ [ b 11 b 12 b 21 b 22 ] = 例如:\begin{bmatrix} a_{11}& a_{12}\\ a_{21}& a_{22} \end{bmatrix}* \begin{bmatrix} b_{11}& b_{12}\\ b_{21}& b_{22} \end{bmatrix}= 例如:[a11a21a12a22]∗[b11b21b12b22]= [ a 11 ∗ b 11 + a 12 ∗ b 21 a 11 ∗ b 12 + a 12 ∗ b 22 a 21 ∗ b 11 + a 22 ∗ b 21 a 21 ∗ b 12 + a 22 ∗ b 22 ] \begin{bmatrix} a_{11}*b_{11}+a_{12}*b_{21}& a_{11}*b_{12}+a_{12}*b_{22}\\ a_{21}*b_{11}+a_{22}*b_{21}& a_{21}*b_{12}+a_{22}*b_{22} \end{bmatrix} [a11∗b11+a12∗b21a21∗b11+a22∗b21a11∗b12+a12∗b22a21∗b12+a22∗b22]
对于四维数组 对于四维数组 对于四维数组 A = [ α 11 α 12 … α 1 n α 21 α 22 … α 2 n ⋮ ⋮ ⋱ ⋮ α m 1 α m 2 … α m n ] , 定义 : A=\begin{bmatrix} \alpha_{11}& \alpha_{12}& \dots& \alpha_{1n}\\ \alpha_{21}& \alpha_{22}& \dots& \alpha_{2n}\\ \vdots& \vdots& \ddots & \vdots\\ \alpha_{m1}& \alpha_{m2}& \dots& \alpha_{mn} \end{bmatrix},定义: A= α11α21⋮αm1α12α22⋮αm2……⋱…α1nα2n⋮αmn ,定义: A r t = [ ( α 11 ) r o t T ( α 12 ) r o t T … ( α 1 n ) r o t T ( α 21 ) r o t T ( α 22 ) r o t T … ( α 2 n ) r o t T ⋮ ⋮ ⋱ ⋮ ( α m 1 ) r o t T ( α m 2 ) r o t T … ( α m n ) r o t T ] A_{r}^{t}=\begin{bmatrix} \left(\alpha_{11}\right)_{rot}^{T}& \left(\alpha_{12}\right)_{rot}^{T}& \dots& \left(\alpha_{1n}\right)_{rot}^{T}\\ \left(\alpha_{21}\right)_{rot}^{T}& \left(\alpha_{22}\right)_{rot}^{T}& \dots& \left(\alpha_{2n}\right)_{rot}^{T}\\ \vdots& \vdots& \ddots & \vdots\\ \left(\alpha_{m1}\right)_{rot}^{T}& \left(\alpha_{m2}\right)_{rot}^{T}& \dots& \left(\alpha_{mn}\right)_{rot}^{T} \end{bmatrix} Art= (α11)rotT(α21)rotT⋮(αm1)rotT(α12)rotT(α22)rotT⋮(αm2)rotT……⋱…(α1n)rotT(α2n)rotT⋮(αmn)rotT
对于四维数组 对于四维数组 对于四维数组 A = [ α 11 α 12 … α 1 n α 21 α 22 … α 2 n ⋮ ⋮ ⋱ ⋮ α m 1 α m 2 … α m n ] , 定义 : A=\begin{bmatrix} \alpha_{11}& \alpha_{12}& \dots& \alpha_{1n}\\ \alpha_{21}& \alpha_{22}& \dots& \alpha_{2n}\\ \vdots& \vdots& \ddots & \vdots\\ \alpha_{m1}& \alpha_{m2}& \dots& \alpha_{mn} \end{bmatrix},定义: A= α11α21⋮αm1α12α22⋮αm2……⋱…α1nα2n⋮αmn ,定义: A T = [ α 11 T α 21 T … α m 1 T α 12 T α 22 T … α m 2 T ⋮ ⋮ ⋱ ⋮ α 1 n T α 2 n T … α m n T ] A^{T}=\begin{bmatrix} \alpha_{11}^{T}& \alpha_{21}^{T}& \dots& \alpha_{m1}^{T}\\ \alpha_{12}^{T}& \alpha_{22}^{T}& \dots& \alpha_{m2}^{T}\\ \vdots& \vdots& \ddots & \vdots\\ \alpha_{1n}^{T}& \alpha_{2n}^{T}& \dots& \alpha_{mn}^{T} \end{bmatrix} AT= α11Tα12T⋮α1nTα21Tα22T⋮α2nT……⋱…αm1Tαm2T⋮αmnT
t r ( A ⋅ ( B ∗ v a l i d C ) ) = t r ( ( A ∗ f u l l B r t ) ⋅ C ) = t r ( ( C r t ∗ v a l i d A ) ⋅ B ) tr\left(A\cdot \left(B*_{valid}C\right)\right)=tr\left(\left(A*_{full} B_{r}^{t}\right)\cdot C\right)=tr\left(\left(C_{r}^{t}*_{valid}A\right)\cdot B\right) tr(A⋅(B∗validC))=tr((A∗fullBrt)⋅C)=tr((Crt∗validA)⋅B) A ∈ R n × m × k − l + 1 × k − l + 1 , B ∈ R m × p × l × l , C ∈ R p × n × k × k , 且 k ≥ l ≥ 2. A\in\mathbb{R}^{n\times m\times k-l+1\times k-l+1},B\in\mathbb{R}^{m\times p\times l\times l},C\in\mathbb{R}^{p\times n\times k\times k},且k\ge l\ge 2. A∈Rn×m×k−l+1×k−l+1,B∈Rm×p×l×l,C∈Rp×n×k×k,且k≥l≥2.
t r ( A ⋅ ( B ∗ f u l l C ) ) = t r ( ( A ∗ v a l i d B r t ) ⋅ C ) = t r ( ( C r t ∗ v a l i d A ) ⋅ B ) tr\left(A\cdot \left(B*_{full}C\right)\right)=tr\left(\left(A*_{valid} B_{r}^{t}\right)\cdot C\right)=tr\left(\left(C_{r}^{t}*_{valid}A\right)\cdot B\right) tr(A⋅(B∗fullC))=tr((A∗validBrt)⋅C)=tr((Crt∗validA)⋅B) A ∈ R n × m × k + l − 1 × k + l − 1 , B ∈ R m × p × l × l , C ∈ R p × n × k × k , 且 k ≥ 2 , l ≥ 2. A\in\mathbb{R}^{n\times m\times k+l-1\times k+l-1},B\in\mathbb{R}^{m\times p\times l\times l},C\in\mathbb{R}^{p\times n\times k\times k},且k\ge 2,l\ge 2. A∈Rn×m×k+l−1×k+l−1,B∈Rm×p×l×l,C∈Rp×n×k×k,且k≥2,l≥2.
A , B 为四维数组 , 则 : A,B为四维数组,则: A,B为四维数组,则: ( A ∗ B ) T = B T ∗ A T (A*B)^{T}=B^{T}*A^{T} (A∗B)T=BT∗AT