基于深度学习模型的电磁超表面设计关键技术

Key technologies in electromagnetic metasurface design with deep learning models

  • 摘要:
    目的 针对基于深度学习的超表面优化高度依赖大规模全波仿真数据导致优化效率受限的问题,需要深入开展超表面设计场景中的数据集构建方法、深度学习模型结构构建方法、深度学习模型训练方法以及超表面优化方法研究。
    方法 首先,建立以样本重要性为依据的训练样本集采样方法,在减少训练所需样本数量的同时降低模型估算超表面电磁响应的误差;其次,构建一种多模态深度学习模型用于同时提取超表面向量化结构参数特征以及图案特征,提高模型估算响应的性能;再次,提出一种利用深度学习域外泛化特性的模型训练方法,充分利用模型自身泛化性能生成低保真度样本,从而减少高保真度训练样本集的规模;最后,不使用高代价高准确性的精确模型,而是提出一种使用有限样本规模训练得到的粗略深度学习模型进行超表面设计的优化方法。
    结果 数值结果表明,在维持深度学习模型性能的前提下,所提出的数据集构建方法、多模态模型构建方法和训练方法能够将训练所需的全波仿真样本减少30%~50%,大幅减少优化超表面所需的全波仿真。此外,所提出的基于粗略深度学习模型的优化方法能够以数十次迭代和全波仿真设计得到性能优越的超表面。
    结论 所构建的贯穿“数据−模型−训练−应用”闭环的低数据依赖型系统框架,通过4个层面的针对性重构,系统性地缓解了制约超表面设计效率的数据规模难题,为深度学习在电磁工程领域的低成本应用提供了通用的方法论支撑与范式。

     

    Abstract:
    Objective To address the problem that deep-learning-based optimization of metasurfaces relies heavily on large-scale full-wave simulation data, thereby limiting optimization efficiency, the dataset construction methods, deep learning model structure construction methods, deep learning model training methods, and optimization methods for metasurface structure design scenarios are investigated.
    Method First, a training-set sampling method based on sample importance is developed. By evaluating the gradient information of the loss function relative to the samples, this method strategically identifies and selects highly informative data points, significantly reducing the required sample volume while improving the model's accuracy in estimating the electromagnetic responses of metasurface structures. Second, a multimodal deep learning model is constructed to simultaneously extract and integrate features from both vectorized structural parameters and pixelated patterns. Through a systematic feature fusion mechanism, the structural representation capability is enhanced, further improving the response-estimation performance. Third, a novel training method leveraging the out-of-distribution (OOD) generalization property of the deep learning model is proposed. This strategy utilizes the model's intrinsic generalization capabilities to synthesize and introduce low-fidelity response samples outside the initial training distribution, dynamically expanding the feature space and thereby reducing the necessary scale of the high-fidelity training sample set. Finally, instead of relying on a globally accurate model with high computational training costs, an efficient optimization method for metasurface structures is proposed. This approach utilizes a coarse deep learning model trained with a strictly limited sample size, operating in conjunction with an iterative refinement mechanism to guide the optimization process.
    Results Numerical results demonstrate the high efficiency of this data-driven framework. Specifically, while the baseline performance of the deep learning models is rigorously maintained, the respective implementations of the proposed dataset construction method, multimodal model architecture, and OOD training strategy each reduce the number of full-wave simulation samples required for initial training by 30%–50%. This substantial reduction directly alleviates the computational burden associated with generating high-fidelity datasets. Furthermore, during the practical optimization phase, it is demonstrated that the proposed algorithm based on the coarse deep learning model achieves rapid convergence. Metasurface structures exhibiting excellent electromagnetic performance can be successfully designed and synthesized, requiring only dozens of additional iterations and full-wave simulation validations, proving the method's capability to bypass the reliance on highly precise, computationally expensive surrogate models.
    Conclusion A low-data-dependency system framework is detailed that encompasses the process of “data-model-training-application”. Through targeted restructuring at four different levels, it systematically addresses the challenge of data scale limitations that hinder the efficiency of metasurface design. As a result, it provides general methodological support and a paradigm for the low-cost application of deep learning in the field of electromagnetic engineering.

     

/

返回文章
返回