基于人工势场法改进MADDPG算法的AUV协同应召搜潜航路规划研究

On-Call Antisubmarine Path Planning for AUVs Based on an Artificial Potential Field-Enhanced MADDPG Algorithm

  • 摘要: 摘 要:【目的】为提高AUV在复杂水下环境中的协同探测效率和稳定性,提出了一种基于人工势场法改进MADDPG算法的AUV协同应召搜潜航路规划模型【方法】针对搜潜路径规划中使用APF容易局部最优;MADDPG算法前期盲目探索,收敛性差的问题,提出使用人工势场法的引力场引导AUV前期运动方向并与MADDPG结合的算法(APF-MADDPG)。通过蒙特卡洛方法仿真大量目标可能轨迹,统计所有目标轨迹点不同时刻所在的海域位置,进而实现预测动态水下目标的散布规律。同时,综合考虑声呐不同距离的探测概率与累积探测概率公式结合作为路径评估指标,采用该算法实现双AUV与三AUV的协同探测仿真。【结果】仿真实验结果显示,APF-MADDPG算法在双AUV协同探测场景中相比原始MADDPG将累积探测概率(CDP)提高了7个百分点,达到80.93%,在三AUV协同探测场景中提升了0.6个百分点,达到92.67%。【结论】APF-MADDPG算法有效地提升了AUV协同搜潜任务的探测效率和稳定性,对于提高AUV在水下环境中的协同作战能力具有参考价值。未来研究可以进一步探索其他深度强化学习算法在同一搜潜场景下的性能对比,以进一步提升搜潜场景下多AUV协同的探测效率与协同作战能力。

     

    Abstract: Abstract:Objectives To enhance the efficiency and stability of cooperative AUV detection in complex underwater environments, a cooperative search and detection path planning model for AUVs based on an improved MADDPG algorithm with the Artificial Potential Field (APF) method is proposed. Methods To address the issues of local optima in APF-based path planning and poor convergence due to blind exploration in the early stages of MADDPG, the proposed APF-MADDPG algorithm integrates the gravitational field of the APF to guide the initial movement of the AUVs. A Monte Carlo method is used to simulate multiple possible target trajectories and statistically analyze the distribution of target positions at different times to predict the dispersion pattern of dynamic underwater targets. Additionally, the detection probability at various sonar ranges and the cumulative detection probability (CDP) formula are used as path evaluation metrics. The proposed algorithm is employed to simulate cooperative detection scenarios involving two and three AUVs. Results Simulation results show that the APF-MADDPG algorithm improves the cumulative detection probability (CDP) by 7 percentage points, reaching 80.93% in the two-AUV cooperative detection scenario, and by 0.6 percentage points, reaching 92.67%, in the three-AUV scenario, compared to the original MADDPG. Conclusions The APF-MADDPG algorithm effectively enhances the detection efficiency and stability in AUV cooperative search and detection tasks, providing valuable insights for improving the cooperative operational capabilities of AUVs in underwater environments. Future research could further explore the performance of other deep reinforcement learning algorithms in similar search and detection scenarios to further improve the efficiency and cooperative capabilities of multi-AUV systems.

     

/

返回文章
返回