IMU模型下的IEKF

最新推荐文章于 2024-10-01 10:37:56 发布

梦沐星海

最新推荐文章于 2024-10-01 10:37:56 发布

阅读量677

点赞数 2

文章标签：线性代数矩阵机器学习

本文链接：https://blog.csdn.net/weixin_53187505/article/details/123563138

版权

复杂系统下的不变扩展卡尔曼滤波

由前文可知，不变性要靠合理的状态误差设计来实现，但在更复杂的系统中，为所有状态量找到一个满足不变性的状态误差是很困难的，甚至一些状态是不可能被表示为群运算的。在存在这类状态量的系统中，有没有可能保留 $I E K F$ 的主要优点呢（主要指一致性方面的优点），答案是肯定的。

考虑这样一类系统，其系统方程如下：

$\begin{aligned} \dot\Theta_t &= g(\Theta_t) \\ \dot\chi_t &= f(\chi_t, u_t, \Theta_t) \\ y_t &= h(\chi_t, \Theta_t) \end{aligned}$

抛开矢量 $\Theta_{t}$ 不看，上式就是之前讨论的普通系统方程，引入矢量 $\Theta_{t}$ 后，系统的变化不仅依赖当前时刻的状态 $\chi_{t}$ 和输入 $u_{t}$ ，还依赖另外一个随着时间变化的量 $\Theta_{t}$ 。在普通的 $E K F$ 中，可以将 $\Theta_{t}$ 也融入到需要估计的状态量 $\chi_{t}$ 中。这里之所以分开写，是为了在 $I E K F$ 中区分可以被描述为群元素的状态量 $\chi_{t}$ ，和不能被描述为群元素的量 $\Theta_{t}$ 。

以一个完整的 $I M U$ 系统为例，系统方程为：

$\begin{aligned} \dot{R}_t&=R_t[\tilde{w}_t-b_t^w-n_t^w]_\times \\ \dot{v}_t&=R_t(\tilde{a}_t-b_t^a-n_t^a)+g \\ \dot{p}_t&=v_t \\ \dot{b}_t^w&=n_t^{bw} \\ \dot{b}_t^a &=n_t^{ba} \end{aligned}$

$其中b^w_t,b^a_t分别为IMU的gyro bias和accelerator bias，他们满足Random Walk模型，即随时间的变化率是一个满足高斯分布的白噪声。$

由前文可知， $R_t,v_t,p_t$ 能够构成一个群元素并且使得最终导出的误差系统方程满足不变性（系统矩阵与状态无关），但如果将两个 $b i a s$ 也放入群中，就很难找到一个状态误差使其满足不变性了。也就是说， $R_t,v_t,p_t$ 就是上文中的 $\chi_{t}$ ，而 $b^w_t,b^a_t$ 就是上文中的 $\Theta_{t}$ 。

为了处理这类系统，可以将状态误差定义为如下形式：

$e_t=(\hat{\chi}_t\chi_t^{-1},\ \hat\Theta_t-\Theta_t)=(\eta_t,\ \zeta_t)$

即右不变误差与普通线性误差的笛卡尔积，是一种混合误差。

混合误差的系统方程

现在需要推导在混合误差下，该误差的系统方程是怎样的。直接以上文的 $I M U$ 系统为例，令

$\chi_t= \begin{pmatrix} R_t & v_t & p_t \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{pmatrix},\ \ \eta_t=\hat{\chi}_t\chi_t^{-1}$

以及

$\Theta_{t}=\begin{pmatrix} b_t^w \\ b_t^a \end{pmatrix},\ \ \zeta_t= \begin{pmatrix} \hat{b}_t^w - b_t^w \\ \hat{b}_t^a - b_t^a \end{pmatrix}= \begin{pmatrix} \zeta_t^w \\ \zeta_t^a \end{pmatrix}$

推导方式模仿前文中普通系统的推导，也就是计算：

$\dot{e}_{t}=\begin{pmatrix} \dot{\eta}_t \\ \\ \dot{\zeta}_t \end{pmatrix}$

其中 $\dot{\eta}_{t}$ 的定义和前文一样

$\begin{aligned} \dot{\eta}_t\approx \begin{pmatrix} [\dot{\xi}_{R_t}]_\times & \dot{\xi}_{v_t} & \dot{\xi}_{p_t} \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix} = \frac{d}{dt}\Lambda \begin{pmatrix} \xi_{R_t} \\ \xi_{v_t} \\ \xi_{p_t} \end{pmatrix} \end{aligned}$

仅在 $\xi_{R_t}, \xi_{v_t}, \xi_{p_t}$ 的计算中有少许不同，分别为

$\begin{aligned} \\ [\dot{\xi}_{R_t}]_\times\approx\dot{\eta}_{R_t}&=\dot{\hat{R}}_tR_t^T+\hat{R}_t\dot{R}_t^T \\ &=\hat{R}_t[\tilde{w}_t-\hat{b}_t^w]_\times R_t^T-\hat{R}_t[w_t]_\times R_t^T \\ &=\hat{R}_t[\tilde{w}_t-\hat{b}_t^w-w_t]_\times R_t^T \ \ \ \ (\text{其中}\ \tilde{w}_t=w_t+b_t^w+n_t^w) \\ &=\hat{R}_t[n_t^w-\zeta_t^w]_\times R_t^T \\ &=\hat{R}_t[n_t^w-\zeta_t^w]_\times \hat{R}_t^T \hat{R}_tR_t^T \\ &=[\hat{R}_t (n_t^w-\zeta_t^w)]_\times \eta_{R_t} \\ &\approx [\hat{R}_t (n_t^w-\zeta_t^w)]_\times ( I+[\xi_{R_t}]_\times) \\ &\approx [\hat{R}_t (n_t^w-\zeta_t^w)]_\times \end{aligned}$

以及

$\begin{aligned} \dot{\xi}_{v_t} &= \dot{\hat{v}}_t - \dot{\eta}_{R_t}v_t-\eta_{R_t}\dot{v}_t \\ &=\hat{R}_t(\tilde{a}_t-\hat{b}_t^a)+g-[\hat{R}_t (n_t^w-\zeta_t^w)]_\times v_t-\hat{R}_tR_t^T(R_ta_t+g) \\ &=\hat{R}_t(\tilde{a}_t-\hat{b}_t^a-a_t) + g-[\hat{R}_t (n_t^w-\zeta_t^w)]_\times v_t-\hat{R}_tR_t^Tg \ \ \ \ (\text{其中}\ \tilde{a}_t=a_t+b_t^a+n_t^a)\\ &\approx \hat{R}_t(n_t^a-\zeta_t^a)+g-[\hat{R}_t (n_t^w-\zeta_t^w)]_\times v_t-(I+[\xi_{R_t}]_\times)g \\ &=[g]_\times \xi_{R_t}+([v_t]_\times \hat{R}_t) (n_t^w-\zeta_t^w)+\hat{R}_t(n_t^a-\zeta_t^a) \end{aligned}$

以及

$\begin{aligned} \dot{\xi}_{p_t} &= \dot{\hat{p}}_t - \dot{\eta}_{R_t}p_t-\eta_{R_t}\dot{p}_t \\ &=\hat{v}_t-[\hat{R}_t (n_t^w-\zeta_t^w)]_\times p_t - \hat{R}_tR_t^Tv_t \\ &=(\hat{v}_t-\hat{R}_tR_t^Tv_t)-[\hat{R}_t (n_t^w-\zeta_t^w)]_\times p_t \\ &=\xi_{v_t}+([p_t]_\times \hat{R}_t) (n_t^w-\zeta_t^w) \end{aligned}$

而 $\zeta_{t}$ 由于是普通的线性误差，推导就比较简单了，即

$\dot{\zeta}_t= \begin{pmatrix} \dot{\zeta}_t^w \\ \dot{\zeta}_t^a \end{pmatrix} = \begin{pmatrix} -n_t^{bw} \\ -n_t^{ba} \end{pmatrix}$

综上，模仿前文的处理方式，可以得到整个系统状态误差的系统方程：

$\frac{d}{dt} \begin{pmatrix} \xi_{R_t} \\ \xi_{v_t} \\ \xi_{p_t} \\ \zeta_t^w \\ \zeta_t^a \end{pmatrix} = \begin{pmatrix} 0 & 0 & 0 & -\hat{R}_t & 0\\ [g]_\times & 0 & 0 & -[\hat{v}_t]\hat{R}_t & -\hat{R}_t \\ 0 & I & 0 & -[\hat{p}_t]\hat{R}_t & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{pmatrix} \begin{pmatrix} \xi_{R_t} \\ \xi_{v_t} \\ \xi_{p_t} \\ \zeta_t^w \\ \zeta_t^a \end{pmatrix} + \begin{pmatrix} \hat{R}_t & 0 & 0 & 0 & 0\\ [\hat{v}_t]_\times\hat{R}_t & \hat{R}_t & 0 & 0 & 0\\ [\hat{p}_t]_\times\hat{R}_t & 0 & \hat{R}_t & 0 & 0\\ 0 & 0 & 0 & -I & 0 \\ 0 & 0 & 0 & 0 & -I \end{pmatrix} \begin{pmatrix} n_t^w \\ n_t^a \\ 0 \\ n_t^{bw} \\ n_t^{ba} \end{pmatrix}$

将其写为更紧凑的形式，容易看出带有 $b i a s 的 I M U 系统与之前不带 b i a s 的 I M U$ 系统之间的联系

$\frac{d}{dt} \begin{pmatrix} \xi_t \\ \zeta_t \end{pmatrix} = \begin{pmatrix} F_{\chi_t} & F_{\chi_t, \Theta_t} \\ 0 & F_{\Theta_t} \end{pmatrix} \begin{pmatrix} \xi_t \\ \zeta_t \end{pmatrix}+ \begin{pmatrix} Ad_{\hat{\chi}_t} & 0 \\ 0 & -I \end{pmatrix} \begin{pmatrix} n_t \\ n_t^b \end{pmatrix}$

其中 $F_{\chi_{t}}$ 就是前文中不带 $b i a s$ 的 $I M U$ 系统的系统矩阵， $Ad_{\hat{\chi}_{t}}$ 是其噪声矩阵，带有 $b i a s 的 I M U$ 系统，只是在原来的系统矩阵上进行了增广。这个结论适用于所有满足本文开头所定义的系统形式。

可以看到，在这种混合误差形式下，系统矩阵不再是一个常量了，每个时刻的系统矩阵与当前状态的估计值有关。但是，这种联系只通过 $b i a s$ 或噪声等小量引入。可以证明对于这样的系统方程，一致性仍然能够保证。同时大量的实验也表明，这种“非完美”的 $I E K F$ 在鲁棒性和收敛性上仍然优于传统的 $E K F$ 。

混合状态的更新

之前已经知道，观测误差 $z_{t}$ 与状态误差 $\xi_{t}$ 的线性关系为

$\tilde{z}_t=H_t\xi_t+s_t$

其中 $s_{t}$ 是观测噪声。

现在设系统的观测量不变（即 $\Theta_{t}$ 是无法直接观测到的），但是状态误差变成了 $(\xi_t, \zeta_t)$ ，显然线性关系应该变为

$\tilde{z}_t=(H_{\chi_t}\ \ H_{\Theta_t}) \begin{pmatrix} \xi_t \\ \zeta_t \end{pmatrix}+s_t$

以带有 $b i a s 的 I M U$ 系统为例， $H_{\chi_{t}}$ 就是上文中的观测矩阵 $H_{t}$ 即

$H_{\chi_t}= \begin{pmatrix} -[m_1]_\times & 0_{3\times3} & I_{3\times3} \\ \vdots \\ -[m_k]_\times & 0_{3\times3} & I_{3\times3} \end{pmatrix}$

其中 $m_{k}$ 是观测到的 $l a n d m a r k$ 在世界系下的坐标。而观测误差与 $b i a s$ 的误差没有直接关系，即

$H_{\Theta_t}=(0_{3k\times3}\ \ 0_{3k\times3})$

同理，线性状态误差的更新变为了：

$\begin{pmatrix} \xi_t^+ \\ \zeta_t^+ \end{pmatrix}= \begin{pmatrix} \xi_t \\ \zeta_t \end{pmatrix}+ \begin{pmatrix} K_t^\xi \\ K_t^\zeta \end{pmatrix}\tilde{z}_t$

卡尔曼增益的计算都一样，即

$K_t= \begin{pmatrix} K_t^\xi \\ K_t^\zeta \end{pmatrix}=P_tH_t^T(H_tP_tH_t^T+V_t)^{-1}$

只是现在的 $H_t=(H_{\chi_t}\ \ H_{\Theta_t})$ ，状态的协方差矩阵 $P_{t}$ 维数上也有相应改变。

最终，混合状态的更新方式也采用混合的形式，能表示为群元素的部分采用指数更新，不能表示为群元素的部分采用普通的线性更新，即

$\begin{pmatrix} \hat{\chi}_t^+ \\ \hat{\Theta}_t^+ \end{pmatrix}= \begin{pmatrix} \text{Exp}(K_t^\xi\tilde{z}_t)\hat{\chi}_t \\ \hat{\Theta}_t+K_t^{\zeta}\tilde{z}_t \end{pmatrix}$

$I E K F$ 算法总结

设系统方程满足如下形式：

$\begin{aligned} \dot\Theta_t &= g(\Theta_t) \\ \dot\chi_t &= f(\chi_t, u_t, \Theta_t) \\ y_t &= h(\chi_t, \Theta_t) \end{aligned}$

其中 $\chi_t, \Theta_t$ 都是需要估计的量，但是 $\chi_{t}$ 是一个群元素 $（比如SE_k(3)中的元素），\Theta_{t}$ 是一个矢量，一般不可直接观测。则该系统在 $I E K F$ 框架下，具有如下形式的系统矩阵 $F_{t}$ 和观测矩阵 $H_{t}$ :

$F_t= \begin{pmatrix} F_{\chi_t} & F_{\chi_t, \Theta_t} \\ 0 & F_{\Theta_t} \end{pmatrix},\ \ H_t=(H_{\chi_t}\ \ H_{\Theta_t})$

且卡尔曼增益

$K_t= \begin{pmatrix} K_t^\xi \\ K_t^\zeta \end{pmatrix}=P_tH_t^T(H_tP_tH_t^T+V_t)^{-1}$

状态的更新方式为

$\begin{pmatrix} \hat{\chi}_t^+ \\ \hat{\Theta}_t^+ \end{pmatrix}= \begin{pmatrix} \text{Exp}(K_t^\xi\tilde{z}_t)\hat{\chi}_t \\ \hat{\Theta}_t+K_t^{\zeta}\tilde{z}_t \end{pmatrix}$