现代制造工程 ›› 2026, Vol. 547 ›› Issue (4): 61-69.doi: 10.16731/j.cnki.1671-3133.2026.04.008

• 车辆工程制造技术 • 上一篇    下一篇

基于RL-MPC的智能车辆轨迹跟踪横向控制策略*

郝亮1, 董耀付1, 刘磊1, 杨少华2   

  1. 1 辽宁工业大学,锦州 121000;
    2 中国人民解放军火箭军装备部驻呼和浩特地区军事代表室,呼和浩特 010000
  • 收稿日期:2025-05-30 发布日期:2026-05-07
  • 通讯作者: 郝亮,博士,教授,硕士研究生导师,主要研究方向为特种车辆系统动力学及控制、新能源汽车集成控制等。E-mail:hl867438249@126.com
  • 作者简介:董耀付,硕士研究生,主要研究方向为智能网联汽车。刘磊,博士,教授,主要研究方向为智能控制与无人系统。E-mail:1183321896@qq.com
  • 基金资助:
    *国家自然科学基金区域创新发展联合基金重点项目(U24A20283);辽宁省科技厅计划联合计划项目(技术攻关计划项目)(2024JH2/102600150);辽宁省科技厅成果转化类揭榜挂帅项目(2023JHI/11100003)

Lateral control strategy for intelligent vehicle trajectory tracking based on RL-MPC

HAO Liang1, DONG Yaofu1, LIU Lei1, YANG Shaohua2   

  1. 1 Liaoning University of Technology,Jinzhou 121000,China;
    2 Military Representative Office of the Equipment Department of the Rocket Force of the Chinese People′s Liberation Army in Hohhot Area,Hohhot 010000,China
  • Received:2025-05-30 Published:2026-05-07

摘要: 针对智能车辆在横向轨迹跟踪过程中难以兼顾轨迹跟踪精度和行驶稳定性的问题,提出了一种基于强化学习与模型预测控制(Reinforcement Learning with Model Predictive Control,RL-MPC)的智能车辆轨迹跟踪横向控制策略。首先,构建车辆系统动力学模型;然后,依托模型预测控制(Model Predictive Control,MPC)的滚动优化框架实时量化轨迹跟踪精度,同时构建奖励函数,实现控制策略的自主优化,并通过反馈校正模块修正预测偏差;最后,通过Matlab/Simulink软件搭建仿真模型,进行仿真实验,并将RL-MPC控制器与传统MPC控制器进行对比,实验结果表明,在不同工况下,相较传统MPC控制器,RL-MPC控制器对轨迹的跟踪效果更好,并且显著降低了横向轨迹跟踪误差,提升了行驶工况下的操纵稳定性。

关键词: 智能车辆, 轨迹跟踪, 横向控制, 模型预测控制, 强化学习

Abstract: To address the problem that intelligent vehicles struggle to balance trajectory tracking accuracy and driving stability during lateral trajectory tracking,a lateral control strategy for intelligent vehicle trajectory tracking based on Reinforcement Learning combined with Model Predictive Control (RL-MPC) is proposed. First,a vehicle system dynamics model is established. Then,relying on the receding horizon optimization framework of Model Predictive Control (MPC),the trajectory tracking accuracy is quantified in real time. Meanwhile,a reward function is constructed to realize the autonomous optimization of the control strategy,and a feedback correction module is used to correct the prediction deviation. Finally,a simulation model is built using Matlab/Simulink software for simulation tests,and the RL-MPC controller is compared with the traditional MPC controller. The experimental results show that under different working conditions,compared with the traditional MPC controller,the RL-MPC controller achieves better trajectory tracking performance,significantly reduces the lateral tracking error,and improves the handling stability under driving conditions.

Key words: intelligent vehicles, trajectory tracking, lateral control, Model Predictive Control (MPC), Reinforcement Learning (RL)

中图分类号: 

版权所有 © 《现代制造工程》编辑部 
地址:北京市东城区东四块玉南街28号 邮编:100061 电话:010-67126028 电子信箱:2645173083@qq.com
本系统由北京玛格泰克科技发展有限公司设计开发 技术支持:support@magtech.com.cn