大小模型协同驱动的舰载机起降特情诱因生成方法

A large-small model collaboration-driven method for generating special situation in carrier aircraft takeoff and landing

  • 摘要:
    目的 舰载机起降特情诱因具有样本稀缺、难以预知等特点,为缓解大模型在舰载机起降特情诱因生成场景中的幻觉问题,提出大小模型协同驱动的舰载机起降特情诱因生成方法(SGCAD)。
    方法 首先,基于专业文献与检索增强技术构建舰载机起降知识库,形成关于舰载机正常起降描述的数据集;然后,以正常起降描述作为模板,结合场景提示词,采用大模型生成特情诱因,并借助小模型区分合理与不合理的特情诱因;最后,结合直接偏好优化算法对大模型进行微调,通过多次迭代来逐渐提升合理特情的占比。
    结果 实验结果表明,经过SGCAD多次迭代后,所生成特情诱因中合理特情的占比高达94%,有效缓解了大模型在舰载机起降特情诱因生成场景中的幻觉问题,并显著提升了特情诱因生成的合理性与真实性。此外,经过专家评判,生成的特情诱因涵盖了多种复杂场景,术语规范且符合物理规律。
    结论 研究成果可为舰载机起降特情分析提供参考。

     

    Abstract:
    Objectives The causal factors underlying anomalies during carrier-based aircraft launch and recovery are often rare and unpredictable. To address the hallucination issues commonly encountered by large language models (LLMs) when generating these anomaly causes, we proposes a scenario generation for carrier aircraft driven by large–small model collaboration (SGCAD) method.
    Methods First, a knowledge base for carrier-based aircraft launch and recovery is constructed by integrating professional literature and retrieval-augmented generation (RAG) techniques, creating a dataset of normal operation descriptions. These normal descriptions serve as templates, which, combined with scenario-specific prompts, guide the large language model to generate potential anomaly causes. A smaller model is then employed to discriminate between reasonable and unreasonable anomaly causes. Finally, the large model undergoes fine-tuning using direct preference optimization (DPO) and is iteratively refined to progressively increase the proportion of reasonable anomaly scenarios.
    Results Experimental results show that, after multiple iterations of the SGCAD method, the proportion of reasonable anomaly causes in the generated dataset reaches 94%, effectively mitigating hallucination issues and significantly improving the rationality and realism of the generated content. Expert evaluations further confirm that the generated anomaly causes encompass diverse and complex scenarios, utilize standardized terminology, and adhere to physical laws.
    Conclusions The proposed approach provides a valuable reference for analyzing anomalies during carrier-based aircraft launch and recovery operations.

     

/

返回文章
返回