基于多光谱单目视觉的船载海洋波浪全天候感知方法

A deep learning method for all-weather shipborne ocean wave perception based on multispectral monocular vision

  • 摘要:
    目的 针对船载视觉浪向估计在夜间低照度、局部雨雾遮挡、强反光以及船体横摇、纵摇、艏摇等动态工况下易出现纹理退化、视角扰动和估计不稳定等问题,提出一种面向全天候需求的多光谱单目视觉与惯性测量单元(IMU)姿态辅助船载浪向估计方法,并以主导浪向估计为核心任务,在有限的实船数据条件下开展方法可行性验证。
    方法 首先,构建可见光−长波红外多光谱成像与姿态协同采集系统,通过时间同步与滑动窗口构造时序样本;然后,采用CLAHE增强与U-Net分割提取海面有效区域,同时,基于ResNet-18迁移学习提取视觉特征,并结合姿态正余弦映射实现多模态融合;接着,进一步引入轻量化多头自注意力机制建模跨帧相关性,实现浪向的稳健回归;最后,在部分系统级验证实验中采用3台相互垂直布设的相机分别进行独立推理,以通过结果一致性校验与后处理融合提升输出稳定性。
    结果 结果显示,在可见光条件下,浪向估计平均绝对误差为0.41°,标准差为0.28°,成功率为99.90%;在长波红外条件下,浪向估计平均绝对误差为1.14°,标准差为0.89°,成功率为98.94%,表明长波红外成像能够在所覆盖的夜间与弱光样本中提供有效的海面纹理信息,使系统能够实现连续、稳定的浪向趋势输出。
    结论 所提方法在已采集的白天可见光、弱光/夜间红外以及有限、复杂的环境样本中表现出较好的稳定性,说明多光谱成像、姿态辅助与时序建模相结合具有一定的可行性。

     

    Abstract:
    Objective Accurate and continuous wave-direction perception is an essential prerequisite for shipborne marine environmental awareness, intelligent navigation, route optimization, and safe offshore operations. However, vision-based wave-direction estimation onboard a moving vessel remains challenging, as the observed sea-surface texture is highly susceptible to nighttime low illumination, partial rain–fog occlusion, strong specular reflections, and dynamic vessel motions such as roll, pitch, and yaw. These factors may lead to texture degradation, viewpoint disturbances, image blur, and unstable wave-direction regression. To address these challenges, this paper proposes a multispectral monocular-vision and inertial measurement unit (IMU)-assisted method for shipborne wave-direction estimation to meet all-weather sensing requirements. Dominant wave-direction estimation is treated as the core task, and the feasibility of the proposed method is validated using limited field data collected in real shipborne scenarios.
    Methods  A visible-light and long-wave infrared multispectral imaging system with synchronized attitude acquisition is first constructed. The image streams and IMU measurements are temporally aligned using timestamps, and sequential samples are generated via a sliding-window strategy. To enhance the visibility and stability of sea-surface texture under nonuniform illumination and low-contrast conditions, contrast limited adaptive histogram equalization (CLAHE) is applied to the input images. A U-Net-based segmentation network is then employed to extract the effective sea-surface region while suppressing interference from the sky, vessel structures, wake, and localized high-intensity reflections. On this basis, a ResNet-18 backbone initialized with transfer learning is used to encode visual features from each frame. The synchronized vessel attitude angles are represented using sine–cosine encoding to avoid angular discontinuities and are further embedded into the same feature space as the visual representations via a multilayer perceptron. The visual and attitude features are fused and fed into a lightweight multi-head self-attention module, which models cross-frame temporal dependencies and learns stable directional representations from evolving wave textures. The final wave direction is represented as a two-dimensional unit vector on the angular circle and is recovered using the arctangent function, thereby mitigating discontinuities near the 0°/360° boundary. In a subset of system-level validation experiments, three orthogonally arranged cameras perform independent inference. Their outputs are verified using geometric consistency constraints, and valid results are fused in post-processing to improve output stability.
    Results Field experiments were conducted in the Guangzhou–Zhuhai coastal waters. Under visible-light conditions, the proposed method achieved a mean absolute error of 0.41°, a standard deviation of 0.28°, and a success rate of 99.90% for wave-direction estimation. Under long-wave infrared conditions, the mean absolute error was 1.14°, the standard deviation was 0.89°, and the success rate was 98.94%. These results show that visible-light imagery provides clear wave-crest and wave-trough texture cues under daytime illumination, whereas long-wave infrared imaging still preserves effective sea-surface texture information under nighttime and low-illumination conditions included in this study. The multispectral configuration therefore improves the continuity of shipborne wave-direction estimation across varying illumination conditions. In addition, the incorporation of IMU-based attitude information and temporal attention modeling helps mitigate the effects of vessel motion and single-frame fluctuations, resulting in more stable wave-direction estimation over time.
    Conclusions The proposed multispectral monocular-vision–IMU multimodal temporal estimation method shows good stability on the collected visible-light daytime samples, long-wave infrared low-light/nighttime samples, and a limited set of complex-environment samples. The results indicate that the integration of multispectral imaging, attitude assistance, effective sea-surface extraction, and temporal correlation modeling is feasible for shipborne dominant wave-direction estimation. It should be noted that the current dataset does not systematically cover varying rain intensities, fog conditions, or different wind–wave coupling regimes. Therefore, the results should be regarded as a feasibility validation based on limited field data rather than a comprehensive demonstration of adaptability to all-weather operating conditions. Future work will focus on expanding the field dataset, improving model robustness under severe weather and complex sea states, and extending the framework from dominant wave-direction estimation to more comprehensive wave-parameter perception.

     

/

返回文章
返回