乏信息下基于大模型的无人艇路径规划智能体框架设计与研究

王斐; 曹建鑫; 焦锐阳; 朱修奇; 高睿

doi:10.19693/j.issn.1673-3185.04503

乏信息下基于大模型的无人艇路径规划智能体框架设计与研究

LLM-based intelligent agent framework for USV path planning in information-scarce environments

摘要

摘要:
目的针对现有无人艇路径规划算法在认知盲区、感知缺失和语义模糊等“乏信息”环境下高度依赖规则库和高精度传感输入、难以应对突发未知威胁的局限性，提出一种基于大语言模型的智能体框架，探索其在“乏信息”下进行无人艇路径规划的可行性与优势。
方法通过集成多样化工具函数库（含各类传感器调用接口）及支持自然语言交互的人机交互接口，有效拓展大模型的感知与行动边界。引入长短期记忆机制，同时设计在线与离线相结合的演进策略，实现持续学习与经验积累，使智能体能够从持续交互与经验沉淀中完成自我优化。构建高度结构化的提示工程模板，将身份定位、环境信息、当前状态、传感器观测及置信度、任务目标、思维链提示、ReAct 提示、先验常识、决策优先级和条件判断等信息系统整合为提示内容，以此高效引导并约束大模型开展环境感知、动态推理、人机交互及路径生成等工作。
结果实验结果表明，在静态障碍物环境下，智能体在路径长度、转弯次数、平均转弯角度等关键指标上均优于或不明显逊于A*，RRT，PSO和Hybrid A*等代表性传统算法，综合得分为1，高于其他传统算法（分别为0.47，0.47，0.77，0.76）；在动态环境下，大模型能够有效调用工具函数，适时请求人机交互，并基于实时感知信息执行路径规划。在面临认知盲区、感知缺失和语义模糊等复杂情境时，模型还能结合置信度评估、历史记忆及常识性知识，自适应调整规划策略，展现出卓越的规划推理能力与泛化适应能力，体现出高度类人化的智能水平。
结论所构建的智能体框架以预训练大模型为智能中枢，具备高度的模块化和可扩展性。基于大模型的无人艇路径规划智能体无论是在简单避障还是在“乏信息”等复杂场景下均具有显著优势，有望推动无人艇向更高自主性发展。

Abstract:
Objective Existing unmanned surface vehicle (USV) path planning algorithms rely heavily on rule-based libraries and high-precision sensor inputs, making them ill-suited to “information-scarce” environments characterized by cognitive blind spots, perception loss, and semantic ambiguity, and thus unable to handle sudden unknown threats. This work proposes an intelligent agent framework based on large language models (LLMs) to investigate its feasibility and advantages for USV path planning under such information-scarce conditions.
Method The proposed intelligent agent framework utilizes a pre-trained LLM as its cognitive core, and is designed for high modularity and extensibility. The framework integrates a diverse library of tool functions (including sensor APIs for real-world data acquisition) and the human-computer interaction interface for natural language engagement, which significantly extends the LLM's perceptual and operational capabilities. To facilitate continuous learning and experience accumulation, the framework incorporates long-term and short-term memory mechanisms combined with distinct online and offline evolutionary strategies. This enables the agent to autonomously self-optimize through persistent interaction and accrued experience. Central to the framework is a highly structured prompting template. This template systematically organizes critical contextual information, including the agent's persona, environmental context, current states, sensor observations with associated confidence scores, task objectives, chain-of-thought (CoT) and ReAct prompts, prior knowledge, decision-making priorities, conditional logic rules and so on. This structured prompting effectively guides and constrains the LLM's cognitive processes, enabling it to perform complex tasks such as environmental perception, dynamic reasoning, interactive communication, and goal-directed path planning.
Results Experimental results demonstrate that, in environments with static obstacles, the proposed intelligent agent outperforms or is at least comparable to representative traditional algorithms such as A*, RRT, PSO, and Hybrid A* in key metrics including path length, number of turns, and average turning angle. The agent achieved a composite performance score of 1.00, significantly higher than those of the traditional algorithms (0.47, 0.47, 0.77, and 0.76, respectively). In dynamic environments, the LLM can effectively invoke tool functions, initiate human-in-the-loop interactions when appropriate, and perform real-time path planning based on sensor inputs. When faced with challenging conditions such as cognitive blind spots, perceptual gaps, and semantic ambiguity, the agent can adapt its planning strategy by leveraging confidence assessments, historical memory, and commonsense knowledge, demonstrating robust reasoning, generalization capabilities, and human-like intelligence.
Conclusion These findings indicate that the LLM-based path planning agent exhibits significant advantages in both simple obstacle avoidance and complex, information-scarce scenarios. This approach offers strong potential to advance USVs toward higher levels of autonomy.

HTML全文

参考文献(25)

施引文献

资源附件(0)