基于强化学习的成品油船装载方案自主生成技术研究

尼洪涛; 周清基; 柴松; 齐鸣

doi:10.19693/j.issn.1673-3185.03474

基于强化学习的成品油船装载方案自主生成技术研究

Reinforcement learning-based autonomous generation technology of loading scheme for product oil tanker

摘要

摘要:
目的旨在基于强化学习方法研究液货舱装载方案自主生成技术。
方法以实际运营的成品油船载货量作为输入，以货舱及压载舱的装载率为目标，基于Unity ML-Agents构建智能体与环境，通过PyTorch框架对智能体进行训练，提出一种综合考虑装载时间与纵倾变化幅度的奖励函数计算方法，并以算例分析来验证所提方法的有效性。
结果结果显示，所训练的智能体能够学习良好的策略，并实现液货舱装载方案的自主生成。
结论研究结果表明，将强化学习用于解决多目标条件下的液货舱装载方案自主生成是合理可行的。

Abstract:
Objectives This paper aims to study the automatic generation technology of loading and unloading schemes for the liquid cargo tank of oil tanker based on reinforcement learning.
Methods Using the cargo capacity of an actual operating oil tanker as input and the loading rates of the cargo tank and ballast water tank as the targets, an intelligent agent and environment were built based on Unity ML-Agents. The agent was trained using the PyTorch framework, and a reward function calculation method that comprehensively considers the loading time and the change in the trim amplitude was proposed. Finally, the example analysis were carried out to validate the validity of the proposed method.
Results The results show that, the trained agent can learn good strategies and achieve autonomous generation of liquid cargo tank loading schemes.
Conclusions The study showed that applying reinforcement learning to solve the problem of autonomous generation of liquid cargo tank loading schemes under multi-objective conditions is reasonable and feasible.

HTML全文

参考文献(15)

施引文献

资源附件(0)