Vehicle Lateral Optimal Control【Autonomous Vehicle Planning and Control】

Jay-Wang77

已于 2024-04-06 11:51:10 修改

阅读量773

点赞数 19

文章标签： c++

于 2024-04-03 15:30:34 首次发布

本文链接：https://blog.csdn.net/Seinlvan/article/details/137278220

版权

Vehicle Lateral Optimal Control

Linear Quadratic Regulator (LQR)

代码实现参考： link

Linear Quadratic Regulator (LQR)

A system is described by the standard linear state space model:

$\dot{x} = Ax + Bu \\ y = Cx$

The objective is to bring the non-zero initial state to zero in the infinite time horizon.

The cost function takes the quadratic form:

$\frac{1}{2} \int_{0}^{\infty} (x^T Qx + u^T Ru)dt$

(The advantage of using the quadratic form: avoid negative numbers in the area when the state x is negative; a minimum value can always be found)

$\begin{bmatrix} q_1 & 0 & \ldots & 0 \\ 0 & q_2 & \ldots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \ldots & q_n \end{bmatrix}$

And ( R ) is:

$\begin{bmatrix} r_1 & 0 & \ldots & 0 \\ 0 & r_2 & \ldots & 0 \\ \vdots & \vdots & \ddots & \vdots \\ 0 & 0 & \ldots & r_m \end{bmatrix}$

Note from $x^TQx = q_{1}x_{1}^2 + q_{2}x_{2}^2 + \ldots + q_{n}x_{n}^2$ , where ( Q ) is defined as a matrix with diagonal elements ( q_i ). Where $q_i \geq 0$ , for ( i = 1, 2, … , n ) and $r_i > 0$ , for ( i = 1, 2, … , m ).

( q_i ) are relative weightings among ( x_i ).
If ( q_1 ) is bigger than ( q_2 ), there is a higher penalty/price on error in ( x_1 ) than in ( x_2 ), and control will try to make ( x_1 ) smaller than ( x_2 ), vice versa. (Pay more attention to ( q_1 ))

The same principle applies to $u^TRu = r_1u_1^2 + r_2u_2^2 + \ldots + r_mu_m^2$ .

LQR General Solution (Raccati Equation)

For a LQR problem defined as

System:
$\dot{x} = Ax + Bu \\ y = Cx$

State feedback:
$u = - K x$

The closed loop system is:
$\dot{x} = (A - BK)x = A_{cl}x .$

Cost function:
$\frac{1}{2} \int_{0}^{\infty} (x^T Qx + u^T Ru) dt$

Control law:
$u = -Kx = -R^{-1}B^TPx,$
will bring ( x(\infty) ) to 0 and ( u(\infty) ) to 0

where
$A^TP + PA + Q = PBR^{-1}B^TP$

that is, the optimal control law is a linear feedback of the state vector ( x ), as assumed.

Dynamic Model of Lateral Vehicle Motion

“Bicycle” dynamic model:

$a_y = \left(\frac{d^2y}{dt^2}\right)_{\text{inertial}} = \dot{v}_y + v_x\dot{\psi}$

$F'_{yf} + F'_{yr} = m a_y = m(\dot{v}_y + v_x \ddot{\psi})$

$l_f F'_{yf} - l_r F'_{yr} = I_z \dot{\psi}$

after considering the steering angle δ .

$(F_{yf} \cos(\delta) - F_{xf} \sin(\delta)) + F_{yr} = m(\dot{v}_y + v_x r)$

$l_f (F_{yf} \cos(\delta) - F_{xf} \sin(\delta)) - l_r F_{yr} = I_z \dot{\psi} = I_z r$

Front and Rear Tire Forces:

$F_{yf} = c_f\alpha_f = c_f(\delta - \theta_{vf})$

$F_{yr} = c_r\alpha_r = c_r(-\theta_{vr})$

$\theta_{vf} = \tan^{-1}\left(\frac{v_{yf}}{v_{xf}}\right) = \tan^{-1}\left(\frac{v_y + l_f r}{v_x}\right)$

$\theta_{vr} = \tan^{-1}\left(\frac{v_{yr}}{v_{xr}}\right) = \tan^{-1}\left(\frac{v_y - l_r r}{v_x}\right)$

Lateral and Yaw Dynamics:

$\dot{v}_y = \frac{c_f \left[ \delta - \tan^{-1}\left(\frac{v_y + {r}l_f}{v_x}\right) \right] \cos(\delta) - c_r\tan^{-1}\left(\frac{v_y - {r}l_r}{v_x}\right) - F_{xf}\sin(\delta)}{m} - v_xr$

$\dot{r} = \frac{c_f l_f \left[ \delta - \tan^{-1}\left(\frac{v_y + {r}l_f}{v_x}\right) \right] \cos(\delta) + c_r l_r \tan^{-1}\left(\frac{v_y - {r}l_r}{v_x}\right) - l_f F_{xf}\sin(\delta)}{I_z}$

after small angle assumptions and re-group by variables:
$\cos(\delta) \approx 1$

$\sin(\delta) \approx 0$

$\tan^{-1}(\delta) \approx \delta$

$\dot{v}_y = -\frac{(c_f + c_r)}{mv_x}v_y + \left[\frac{(l_rc_r - l_fc_f)}{mv_x}\right]v_x r + \frac{c_f}{m}\delta$

$\dot{r} = \frac{l_f c_f - l_r c_r}{I_z v_x}v_y + \left[-\frac{(c_f l_f^2 + c_r l_r^2)}{I_z v_x}\right]r + \frac{l_f c_f}{I_z}\delta$
请添加图片描述

Dynamic Bicycle Model

#### Linearized Dynamic Model of Lateral Vehicle Motion

If we use state
$\mathbf{X} = \begin{bmatrix} y\\ v_y \\ \psi \\ \dot{\psi} \end{bmatrix} \ and \ input \ \delta \ ,$
rewrite in state space model:
$\mathbf{\dot{X}} = \mathbf{A}\mathbf{X} + \mathbf{B}\delta \ ,$
it is

$\frac{d}{dt}\begin{bmatrix} y\\ \ v_y \\ \psi \\ \dot{\psi} \end{bmatrix} = \begin{bmatrix} 0 & 1 & 0 & 0 \\ 0 & -\frac{(c_f + c_r)}{m v_x} & 0 & \frac{(l_r c_r - l_f c_f)}{m v_x} - v_x \\ 0 & 0 & 0 & 1 \\ 0 & \frac{l_f c_f - l_r c_r}{I_z v_x} & 0 & -\frac{(c_f l_r^2 + c_r l_f^2)}{I_z v_x} \end{bmatrix} \begin{bmatrix} y \\ v_y \\ \psi \\ \dot{\psi} \end{bmatrix} + \begin{bmatrix} 0 \\ \frac{c_f}{m} \\ 0 \\ \frac{l_f c_f}{I_z} \end{bmatrix} \delta$

Trajectory tracking with LQR

Path Coordinates Model (Error dynamics)

For path tracking, it is useful to express the bicycle model with respect to the path function of its length 𝑠 and with the constant longitudinal velocity assumption.
We can choose
$\begin{bmatrix} e_{cg} & \dot{e_{cg}} & e_{\theta} & \dot{e_{\theta}} \end{bmatrix}^T$

as our system state and $\delta$ .

$e_{cg}$ : Orthogonal distance of the C.G. to the nearest path waypoint;
$\dot{e_{cg}}$ : Relative speed between vehicle C.G and path;
$e_{\theta}$ : Heading/Yaw difference between vehicle and path, $e_{\theta} = \theta - \theta_p(s)$
$\dot{e_{\theta}}$ : Relative yaw rate between vehicle C.G and path, $e_{\theta} = r - r(s)$ where $\dot{\theta(s)}$ is the yaw rate derived from the path.

在这里插入图片描述

Dynamic Bicycle Model in path coordinates

With the constant longitudinal velocity assumption,

$\dot{e}_{cg} = v_y + v_x \tan(\theta - \beta_p(s)) \\= v_y + v_x \tan(e_{\theta})$

Thus, the acceleration of C.G. is:
$\dot{e}_{cg} = (\dot{v}_y + v_xr') - \dot{v}_y(s) \\ = \dot{v}_y + v_x(r - r(s)) \\= \dot{v}_y + v_x\dot{e}_\theta.$

Convert lateral dynamic to error dynamics:
$v_y = \dot{e}_{cg} - v_x\sin(e_\theta) \\ \dot{v}_y = \dot{e}_{cg} - v_x\dot{e}_\theta \\ \theta = e_\theta + \theta_p (s) \\ r = \dot{e}_\theta + r(s) \\ \dot{r} = \ddot{e}_\theta + \dot{r}(s)$

Now, we have the linear lateral dynamic model. Rewrite it as:
$\dot{x} = Ax + B_1\delta + B_2 \ r_{des}, \text{ where } x = (e_{cg} \, \dot{e}_{cg} \, e_\theta \,\dot{e}_\theta)^T.$

$\begin{bmatrix} 0 & 1 & 0 & 0 \\ 0 & -\frac{(c_f + c_r)}{mv_x} & \frac{c_f + c_r}{m} & \frac{(l_r c_r - l_f c_f)}{mv_x} \\ 0 & 0 & 1 & 0 \\ 0& \frac{l_r c_r - l_f c_f}{I_z v_x} & \frac{l_r c_r - l_f c_f}{I_z} & -\frac{(c_f l_f^2 + c_r l_r^2)}{I_z v_x} & 0 \\ \end{bmatrix} , B_1 = \begin{bmatrix} 0 \\ \frac{c_f}{m} \\ 0 \\ \frac{l_f c_f}{I_z} \\ \end{bmatrix} , B_2 = \begin{bmatrix} 0 \\ -\frac{l_f c_r - l_r c_f}{m v} -v\\ 0 \\ -\frac{l_f^2 c_f + l_r^2 c_r}{I_z v} \\ \end{bmatrix}$

Basic work flow:

Check the controllability matrix has full rank: $B_1, A B_1, A^2 B_1, A^3 B_1]$ .
Convert the continuous time system to discrete time.
$A_dx(k) + B_{1d}\delta(k) + B_{2d}\ r_{des}(k)$
Use the full state feedback law: $\delta = -Kx = -k_1 e_{cg} - k_2 \dot{e}_{cg} - k_3 e_\theta - k_4 \dot{e}_\theta.$
Add feedforward control to calculate the steering angle theoretically required to drive smoothly along the trajectory without any deviation: $\delta_s = {\delta_f} -Kx = atan(\rho*(l_r + l_f)) -k_1 e_{cg} - k_2 \dot{e}_{cg} - k_3 e_\theta - k_4 \dot{e}_\theta.$

Apply LQR in this situation, we have
$\delta^*(k) = -Kx(k) + {\delta_f}^*(k)$
Where $K = (R + B_d^T P B_d)^{-1} B_d^T P A_d.$

Objective cost function to be minimized by the control is
$\sum_{k=0}^{\infty} x(k)^T Q x(k) + \delta(k)^T R \delta(k)$

where P satisfies the matrix difference Riccati equation
$P = A_d^T P A_d - A_d^T P B_d(R + B_d^T P B_d)^{-1} B_d^T P A_d + Q$

(The P matrix provides an estimate of the cost required to reach the target state for a given state. It reflects the performance index in the process of transferring the system state to the target state.)

LQR tuning

Let $\begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}$ , we choose different ( R ):

When ( R = 10 ):
- Penalty on control effort is large → Less control effort is used → slower response.
When ( R = 1 ):
- Penalty on control effort is small → More control effort is used → faster response.

Position error response to initial condition

在这里插入图片描述

Control effort

Let ( R = 1 ), we choose different ( Q ):

When $\begin{bmatrix} 10 & 0 \\ 0 & 1 \end{bmatrix}$ :
- Penalty on position error is greater than penalty on speed error → prioritizes minimizing position error → may result in a faster response in position tracking.
When $\begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}$ :
- Penalty on position error is equal to penalty on speed error → balanced approach → may result in a response that does not prioritize one over the other.

Position error response to initial condition

在这里插入图片描述

Speed error response to initial condition

代码框架

LqrController

Properties
- Time step (ts_)
- Front and rear wheel cornering stiffness (cf_, cr_)
- Wheelbase and steer ratio (wheelbase_, steer_ratio_)
- Maximum steering angle (steer_single_direction_max_degree_)
- Vehicle mass and inertia (mass_, iz_)
- Front and rear distances from the center of mass (lf_, lr_)
- LQR algorithm parameters (lqr_eps_, lqr_max_iteration_)
Initialization
- Load vehicle and LQR configuration (LoadControlConf)
- Initialize matrices and control parameters (Init)
Control Logic
- Update vehicle state and compute control command (ComputeControlCommand)
- State Update
  - Compute lateral errors (ComputeLateralErrors)
  - Update state matrix (UpdateState)
- Matrix Updates
  - Update the state matrix A and discretize the state matrix A (UpdateMatrix)
- Control Computation
  - Solve LQR problem to obtain control gains (SolveLQRProblem)
  - Compute feedback and feedforward components (ComputeFeedForward)
  - Calculate the final steering angle
Utilities
- Normalize angles and calculate distances (NormalizeAngle, PointDistanceSquare)
- Query the nearest trajectory point (QueryNearestPointByPosition)

LoadControlConf()

加载控制器配置，设置车辆动力学参数。如：侧偏刚度，时间步，前后悬质量，前后轴长度，惯性矩，LQR迭代准确性和迭代次数。

Init()

初始化控制器，构建状态空间模型（A, B矩阵）和成本函数（Q, R矩阵）。

ComputeControlCommand(…)

计算并输出控制命令，是控制器的核心逻辑部分。

配置A, B矩阵
通过UpdateState()更新状态和计算横向误差

UpdateState(const VehicleState &vehicle_state)

根据车辆当前状态更新状态向量，并通过ComputeLateralErrors(…)计算横向误差。

UpdateMatrix(const VehicleState &vehicle_state)

更新状态矩阵A并将状态矩阵A离散化。

ComputeLateralErrors(…): 计算并更新四个状态量

SolveLQRProblem(matrix_ad_, matrix_bd_, matrix_q_, matrix_r_, lqr_eps_, lqr_max_iteration_, &matrix_k_)

实现LQR算法，计算最优反馈增益矩阵K。
输入参数

Matrix &A, &B：系统的动态矩阵，用于描述车辆的动态行为。
Matrix &Q, &R：权重矩阵，分别用于量化状态偏差和控制输入的成本。
double tolerance：算法收敛的容忍度。
max_num_iteration：算法的最大迭代次数。
Matrix *ptr_K：指向计算出的反馈增益矩阵K 的指针。

steer_angle_feedback

calculate feedback, steer = -K * state.

steer_angle_feedforward

计算理论上需要的转向角度，以便在没有任何偏差时沿轨迹平稳行驶。

ComputeFeedForward (const VehicleState &localization, double ref_curvature): 根据给定的 ref_curvature和车辆的轴距来计算前馈转向角，转向角度等于车辆轴距与曲率的乘积的反正切。

Jay-Wang77

关注

19
点赞
踩
22

收藏

觉得还不错? 一键收藏
1
评论
Vehicle Lateral Optimal Control【Autonomous Vehicle Planning and Control】

探索车辆横向最优控制：Linear Quadratic Regulator (LQR)专为处理车辆的横向运动设计。通过清晰的数学模型和直观的动态系统分析，揭示了LQR如何优化车辆的轨迹跟踪，从标准的线性状态空间模型出发，详细阐述了成本函数的二次形式如何帮助LQR算法在无限时间范围内，将非零初始状态平滑调整至目标状态。展示了如何调整控制力度和状态误差的相对重要性，实现对车辆横向动态的精细控制。还探讨了“自行车”动态模型如何应用于LQR，以及如何通过Riccati方程求解最优控制律。
复制链接

扫一扫