基于深度强化学习的多约束舰载机动态路径规划研究

Dynamic multi-constraint path planning of carrier-based aircraft based on deep reinforcement learning

  • 摘要:
    目的 现有的舰载机路径规划方法普遍忽略了舰载机转运过程中的实际空间约束,且难以满足高度动态变化的甲板环境,因此,提出一种综合考虑位姿约束和运动约束的舰载机动态路径规划算法。
    方法 首先,利用多边形法对舰载机外形进行几何建模,并基于舰载机转运速度、朝向角等参数构建舰载机运动学模型;然后,将舰载机路径规划问题建模为马尔可夫决策过程,并根据舰载机的运动特征来确定动作空间和状态空间,综合考虑位姿、安全、效率等多种因素来设计奖励函数,进而提出基于深度强化学习的舰载机路径规划算法;最后,通过仿真实验验证所提算法的有效性。
    结果 结果表明,相较于传统算法,该算法的调度时间平均减少9.2%,目标朝向角误差平均减少98.7%。
    结论 研究成果有效提高了舰载机的转运效率,可为航空母舰甲板舰载机的调运决策提供参考。

     

    Abstract:
    Objective Most existing path planning methods for carrier-based aircraft fail to account for the practical spatial constraints encountered during their transfer process and have difficulty adapting to the highly dynamic conditions on the deck. To address these limitations, this paper proposes a dynamic path planning algorithm for carrier-based aircraft that comprehensively considers pose and kinematic constraints and desired final heading angles.
    Method Initially, the geometric shape of the carrier-based aircraft is modeled using the polygon method. A kinematic model is then formulated based on parameters such as the aircraft's movement speed and heading angle. Subsequently, the path planning problem for the carrier-based aircraft is formulated as a Markov decision process (MDP). The action and state spaces are defined based on the aircraft's motion characteristics. A reward function is designed by incorporating factors such as pose, orientation, safety, and efficiency. A deep reinforcement learning-based path planning algorithm for carrier-based aircraft is then proposed. Finally, simulations are conducted to validate the effectiveness of the proposed algorithm.
    Results The results demonstrate that, compared to traditional algorithms, the proposed algorithm reduces scheduling time by an average of 9.2% and decreases target heading angle error by an average of 98.7%.
    Conclusion The proposed method effectively improves the transfer efficiency of carrier-based aircraft and provides valuable insights for handling decision in aircraft coordination and deck operations.

     

/

返回文章
返回