基于局部状态感知的无人艇深度强化学习路径规划

Deep Reinforcement Learning Path Planning for Unmanned Surface Vehicle Based on Local Observation

  • 摘要: 【目的】针对海上救援任务中无人艇感知范围有限导致的路径规划效率低、鲁棒性差的问题,提出一种基于局部状态感知的无人艇路径规划方法。【方法】首先,采用Soft Actor-Critic算法,设计了基于局部感知的奖励函数,并结合特征增强训练方法,通过提取环境关键特征并在随机特征环境中训练,提升了有限感知条件下的路径规划采样效率和鲁棒性。此外,提出一种基于局部感知域的迭代航路点规划方法,有效协调局部与全局目标,最终实现在海上救援任务中的高效路径规划。【结果】仿真结果显示,所提出方法在特征环境中路径规划成功率达到98%以上,且在海上救援任务中完成率达到93%以上,表现出对不确定环境较好的鲁棒性和适应性。【结论】所提出的基于局部状态感知的无人艇路径规划方法解决了深度强化学习在海上救援任务中应用问题,为强化学习算法在实际工程中应用提供技术支持。

     

    Abstract: Objectives To address the issues of low path planning efficiency and poor robustness caused by the limited sensing range in maritime rescue missions., a local observation-based path planning method for unmanned surface vehicle is proposed. MethodsThe Soft Actor-Critic algorithm is employed, with a reward function based on local observation designed and combined with a feature-enhanced training method. By extracting key environmental features and training in a randomized feature environment, the path planning sampling efficiency and robustness under limited perception conditions are improved. An adaptive waypoint planning algorithm based on local perception domains is developed, coordinating local and global planning for efficient path planning. Results Simulation results show the proposed method achieves a success rate of over 98% in feature environments and a completion rate of over 93% in rescue missions, demonstrating strong robustness and adaptability. Conclusions The proposed local observation path planning method for unmanned surface vehicle addresses challenges in maritime rescue missions and provides technical support for applying reinforcement learning in real-world scenarios.

     

/

返回文章
返回