欢迎访问《图学学报》 分享到:

图学学报 ›› 2026, Vol. 47 ›› Issue (1): 173-178.DOI: 10.11996/JG.j.2095-302X.2026010173

• 数字化设计与制造 • 上一篇    下一篇

基于大模型的生成式数字孪生体建模

梁生龙1(), 范秋霞2   

  1. 1 珠海城市职业技术学院机电工程学院广东 珠海 519090
    2 山西大学自动化与软件学院山西 太原 030006
  • 收稿日期:2025-04-13 接受日期:2025-08-08 出版日期:2026-02-28 发布日期:2026-03-16
  • 通讯作者:梁生龙,E-mail:a15919281689@126.com
  • 基金资助:
    广东省普通高校特色创新类项目(2023KTSCX327);珠海城市职业技术学院科研项目(KY2023Y03Z);山西省科技合作交流专项-国家重点科技合作项目(202304041101007)

Generative digital twin modeling based on large models

LIANG Shenglong1(), FAN Qiuxia2   

  1. 1 College of Mechanical and Electrical Engineering, Zhuhai City Vocational and Technical College, Zhuhai Guangdong 519090, China
    2 School of Automation and Software, Shanxi University, Taiyuan Shanxi 030006, China
  • Received:2025-04-13 Accepted:2025-08-08 Published:2026-02-28 Online:2026-03-16
  • Supported by:
    Characteristic Innovation Project of Ordinary Universities in Guangdong Province(2023KTSCX327);Research Project of Zhuhai City Vocational and Technical College(KY2023Y03Z);Shanxi Province Science and Technology Cooperation and Exchange Special Project - Key National Science and Technology Cooperation Project(202304041101007)

摘要:

针对数字孪生(DT)技术与大模型在工业设计中融合应用的挑战,提出了一种基于生成式大模型的CAD-LDT数字孪生体建模框架。通过构建物理实体模块、智能生成模块和虚拟实体模块的三元架构,创新性地整合多模态数据融合机制与领域知识约束,实现从物理实体描述到参数化CAD模型的自主生成。采用LLaVA-7B和LLaMA-7B作为基础模型,通过LoRA轻量化适配器实现视觉-文本特征的跨模态对齐,并设计约束编码器将几何公差与物理规则转化为结构化JSON对象。为增强空间变换的数学一致性,引入李群算法优化刚体变换表征;采用几何权重分箱方法离散化复杂装配关系;提出时空解耦生成策略,协同规划空间布局与装配时序。在DeepCAD数据集上的实验结果表明,该框架在几何达到83.6%,约束满足率达91.3%,生成效率提升了26.5%,显著优于主流基线模型。消融实验进一步验证了多模态融合、约束编码机制和李群算法对建模质量的关键贡献。为智能制造领域提供了新的DT建模范式,在参数化设计、装配工艺优化等方面展现出工程应用价值。

关键词: 大模型, 数字孪生, 多模态数据, 智能制造, 参数化设计

Abstract:

To address the challenges in integrating Digital-Twin (DT) technology with large-scale generative models in industrial design, a CAD-LDT digital-twin modeling framework based on generative foundation models was proposed. The framework adopted a triadic architecture consisting of a physical-entity module, an intelligent generation module, and a virtual-entity module, and innovatively incorporated multi-modal data fusion mechanisms and domain-knowledge constraints to enable autonomous generation of parameterized CAD models from physical-entity descriptions. Utilizing LLaVA-7B and LLaMA-7B as backbone models, the framework employed LoRA-based lightweight adapters to achieve cross-modal alignment between visual and textual features, and introduced a constraint encoder that transformed geometric tolerances and physical rules into structured JSON objects. To enhance the mathematical consistency of spatial transformations, Lie-group algorithms were adopted for the optimization of rigid-body transformations, while a geometric-weight binning strategy was proposed to discretize complex assembly relationships. Moreover, a spatiotemporal-decoupled generation strategy was designed to jointly optimize spatial layout and assembly sequencing. Experimental results on the DeepCAD dataset indicated that the proposed framework achieved an Intersection- over-Union (IoU) of 83.6%, a constraint satisfaction rate of 91.3%, and a 26.5% improvement in generation efficiency, significantly outperforming existing baseline models. Further ablation studies confirmed the critical contributions of multi-modal fusion, constraint encoding mechanisms, and Lie-group optimization to modeling performance, providing a novel DT modeling paradigm for intelligent manufacturing with demonstrated value in parametric design and assembly process optimization.

Key words: large models, digital twin, multimodal data, intelligent manufacturing, parametric design

中图分类号: