基于数字孪生的水面多无人智能体协同巡航策略优化方法

Method for optimizing cooperative cruising strategy of surface unmanned multi-agent based on digital twin

  • 摘要:
    目的 针对水面环境上多无人智能体协同自主性和协调性不足的问题,提出一种基于数字孪生的多无人智能体协同巡航策略优化方法。
    方法 首先,根据多无人智能体物理实体来构建数字孪生模型,建立多无人智能体协同巡航的数学模型,并分析多无人智能体协同过程的运动特点;然后,针对多智能体的相互影响和协作关系,采用多智能体近端策略优化算法进行多智能体的策略优化,以提升多无人智能体之间的协同巡航效率;最后,通过搭建的水面多无人智能体的数字孪生模型,对多智能体训练效果进行验证。
    结果 与多智能体深度确定性策略梯度(MADDPG)算法相比,所提出的多智能体近端策略优化(MAPPO)算法的收敛稳定平均奖励值提升了14.7%,无人智能体能够以巡航目标为中心而均匀分布,从而提供更全面的协同巡航信息。
    结论 研究成果可为水面多无人智能体协同巡航策略优化提供理论与实践参考。

     

    Abstract:
    Objectives In view of the problem of insufficient autonomy and coordination of unmanned multi-agent collaborating on the surface environment, a digital twin-based multi-unmanned agent collaborative cruising strategy optimization method is proposed.
    Methods Firstly, according to the construction of digital twin model of unmanned multi-agent physical entity, the mathematical model of unmanned multi-agent collaborative cruising is established to analyze the motion characteristics of the collaborative process of unmanned multi-agent. Then, aiming at the mutual influence and collaborative relationship of unmanned multi-agent, the unmanned multi-agent proximal strategy optimization algorithm is adopted to carry out the strategy optimization of unmanned multi-agent, and improve the cooperative cruising efficiency among unmanned multi-agent. Finally, the proposed water-surface multi-agent digital twin model is utilized to validate the training effectiveness of multiple autonomous agents.
    Results Compared with multi-agent deep deterministic policy gradient (MADDPG), the proposed (muti-agent proximal policy optimization, MAPPO) algorithm achieves a 14.7% improvement in average reward and demonstrates more stable convergence. The unmanned agents are capable of forming a uniformly distributed formation centered around the patrol target, thereby providing more comprehensive information for coordinated cruising.
    Conclusions The study provides significant theoretical and practical support for the research on the optimization of cooperative cruising strategies of unmanned multi-agent on the water surface.

     

/

返回文章
返回