混行环境下的船舶可交互式博弈避碰决策

Ship interactive game collision avoidance decision-making in mixed environment

  • 摘要: 【目的】针对混行环境下自主船舶与传统有人驾驶船舶难以有效沟通避让意图的问题,提出一种基于主从博弈和思维链思想的可交互式避碰决策方法。【方法】首先对混行环境下的船舶避碰场景进行假设描述,将自主船舶与传统有人驾驶船舶建模为领导者-跟随者的主从博弈模型,从航海实践的角度设计了策略空间与收益函数,其次考虑船间的交互过程,设计了一种包含状态感知、意图共享、策略协商和避碰决策四个子模块的思维链可交互式避碰决策算法(Chain of Thought-Game collision avoidance,COT-GCA),最后采取三船和四船两组会遇局面进行实验验证。【结果】结果显示,两组实验中船舶均能高效理解它船避让意图并安全避碰,且避碰行为的响应、转向幅度和复航体现出及早性、大幅性和稳定性,决策单元评价方法计算船舶决策前后的产出效率评价均值分别为1和0.993,接近最优,能够解释该博弈模型在解决船舶交互避碰上的高效能力。【结论】所提模型算法能够有效提高混行环境下船舶的交互避让决策能力,为未来实际应用提供理论意义。

     

    Abstract: Objectives Aiming at the problem that it is difficult to effectively communicate the avoidance intention between autonomous ships and traditional ships in mixed environment, an interactive collision avoidance decision-making method based on Stackelberg game and Chain of Thought is proposed. Methods Firstly, the scenario of ship collision avoidance in mixed environment is described. The autonomous ship and the traditional ship are modeled as a leader-follower Stackelberg game model. The reward functions are designed from the perspective of navigation practice. Secondly, considering the interaction process between ships, a chain of thought-Game collision avoidance decision algorithm ( COT-GCA ) is designed, which includes four sub-modules : state perception, intention sharing, strategy negotiation and collision avoidance decision. Finally, the three-ship and four-ship two-group encounter situations were used for experimental verification. Results The results show that the ships in the two groups of experiments can efficiently understand the avoidance intention of other ships and avoid collision safely, and the response, steering range and resumption of collision avoidance behavior reflect the earlyness, sharpness and stability. The average value of the output efficiency evaluation before and after the decision-making of the decision-making unit evaluation method is 1 and 0.993 respectively, which is close to the optimal, and can explain the high efficiency of the game model in solving the ship 's interactive collision avoidance. Conclusions The proposed model and algorithm can effectively improve the interactive avoidance decision-making ability of ships in mixed environment, and provide theoretical significance for future practical applications.

     

/

返回文章
返回