乏信息下基于大模型的无人艇路径规划智能体框架设计与研究

王斐; 曹建鑫; 焦锐阳; 朱修奇; 高睿

doi:10.19693/j.issn.1673-3185.04503

乏信息下基于大模型的无人艇路径规划智能体框架设计与研究

LLM-based intelligent agent framework for USV path planning in information-scarce environments

摘要

摘要:
目的针对现有无人艇路径规划算法在认知盲区、感知缺失和语义模糊等“乏信息”环境下高度依赖规则库和高精度传感输入，难以应对突发未知威胁的局限性，提出一种基于大语言模型的智能体框架，探索其在“乏信息”下进行无人艇路径规划的可行性与优势。
方法通过集成多样化工具函数库（含各类传感器调用接口）及支持自然语言交互的人机交互接口，有效拓展大模型的感知与行动边界。引入长短期记忆机制，同时设计在线与离线相结合的演进策略，实现持续学习与经验积累，使智能体能够从持续交互与经验沉淀中完成自我优化。构建高度结构化的提示工程模板，将身份定位、环境信息、当前状态、传感器观测及置信度、任务目标、思维链提示、ReAct 提示、先验常识、决策优先级和条件判断等信息系统整合为提示内容，以此高效引导并约束大模型开展环境感知、动态推理、人机交互及路径生成等工作。
结果实验结果表明：在静态障碍物环境下，智能体在路径长度、转弯次数、平均转弯角度等关键指标上均优于或不明显逊于A*，RRT，PSO和Hybrid A*等代表性传统算法，综合得分为1，高于其他传统算法（分别为0.47，0.47，0.77，0.76）；在动态障碍物环境下，大模型能够有效调用工具函数，适时请求人机交互，并基于实时感知信息执行路径规划。在面临认知盲区、感知缺失和语义模糊等复杂情境时，模型还能结合置信度评估、历史记忆及常识性知识，自适应调整规划策略，展现出卓越的规划推理能力与泛化适应能力，体现出高度类人化的智能水平。
结论所构建的智能体框架以预训练大模型为智能中枢，具备高度的模块化和可扩展性。基于大模型的无人艇路径规划智能体无论是在简单避障还是在“乏信息”等复杂场景下均具有显著优势，有望推动无人艇向更高自主性发展。

Abstract:
Objective Existing unmanned surface vehicle (USV) path planning algorithms rely heavily on rule-based libraries and high-precision sensor inputs, making them unsuitable for “information-scarce” environments characterized by cognitive blind spots, perception loss, and semantic ambiguity. Such conditions hinder their ability to respond to sudden, unknown threats. This study proposes an intelligent agent framework leveraging large language models (LLMs) to explore its feasibility and advantages for USV path planning under such information-scarce conditions.
Method The proposed intelligent agent framework leverages a pre-trained LLM as its cognitive core, and is designed for high modularity and extensibility. The framework integrates a diverse library of tool functions (including sensor APIs for real-world data acquisition) and a human-computer interaction interface for natural language engagement, significantly enhancing the LLM's perceptual and operational capabilities. To facilitate continuous learning and experience accumulation, the framework incorporates long-term and short-term memory mechanisms alongside distinct online and offline evolutionary strategies, enabling the agent to autonomously self-optimize through persistent interactions and accrued experience. At the core of the framework is a highly structured prompting template, which systematically organizes critical contextual information such as the agent's persona, environmental context, current states, sensor observations with confidence scores, task objectives, chain-of-thought (CoT) and ReAct prompts, prior knowledge, decision-making priorities, and conditional logic rules. This structured prompting effectively guides and constrains the LLM's cognitive processes, enabling it to perform complex tasks including environmental perception, dynamic reasoning, interactive communication, and goal-directed path planning.
Results Experimental results indicate that, in environments with static obstacles, the proposed intelligent agent outperforms or performs at least comparably to representative traditional algorithms such as A*, RRT, PSO, and Hybrid A* across key metrics including path length, number of turns, and average turning angle. The agent achieved a composite performance score of 1.00, significantly higher than those of the traditional algorithms (0.47, 0.47, 0.77, and 0.76, respectively). In dynamic environments, the LLM effectively invokes tool functions, initiates human-in-the-loop interactions when appropriate, and performs real-time path planning based on sensor inputs. Under challenging conditions such as cognitive blind spots, perceptual gaps, and semantic ambiguity, the agent adapts its planning strategy by leveraging confidence assessments, historical memory, and commonsense knowledge, demonstrating robust reasoning, generalization capabilities, and human-like intelligence.
Conclusion These findings indicate that the LLM-based path planning agent exhibits significant advantages in both simple obstacle avoidance and complex, information-scarce scenarios. This approach holds strong potential for advancing USVs toward higher levels of autonomy.

HTML全文

参考文献(25)

施引文献

资源附件(0)