欢迎访问《图学学报》

图学学报 ›› 2026, Vol. 47 ›› Issue (2): 390-401.DOI: 10.11996/JG.j.2095-302X.2026020390

• 计算机图形学与虚拟现实 • 上一篇    下一篇

基于潜在扩散模型的CAD条件生成

刘景豪, 游振国, 杜冬()   

  1. 南京理工大学数学与统计学院江苏 南京 210094
  • 收稿日期:2025-09-02 接受日期:2025-12-12 出版日期:2026-04-30 发布日期:2026-05-20
  • 通讯作者:杜冬,E-mail:dongdu@njust.edu.cn
  • 基金资助:
    国家自然科学基金(62502209);中央高校基本科研业务费专项资金(30925010538)

Conditional generation of CAD models based on latent diffusion models

LIU Jinghao, YOU Zhenguo, DU Dong()   

  1. School of Mathematics and Statistics, Nanjing University of Science and Technology, Nanjing Jiangsu 210094, China
  • Received:2025-09-02 Accepted:2025-12-12 Published:2026-04-30 Online:2026-05-20
  • Contact: DU Dong,E-mail:dongdu@njust.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62502209);Fundamental Research Funds for the Central Universities(30925010538)

摘要:

基于传统计算机辅助设计(CAD)创建兼具可制造性与可编辑性的三维模型是一项复杂且耗时的任务。近年来,深度学习技术在CAD模型自动化生成方面展现出巨大潜力并成为研究热点。然而,多数CAD生成模型未能充分利用点云、图像和草图等输入数据中蕴含的几何与语义信息,难以通过灵活的条件输入精准控制生成方向。针对这一问题,通过挖掘潜在空间的表征能力,采用去噪扩散概率模型,以这类条件输入数据作为引导,实现CAD模型定向生成。具体而言,首先构建基于Transformer架构的自编码器,将CAD参数命令序列编码至潜在空间;进而在此空间内搭建去噪扩散概率模型,融合点云、图像或草图条件编码信息,生成CAD特征向量;最后通过解码器还原为三维CAD模型。实验结果表明,所生成的CAD模型结构合理、表面光滑且几何特征清晰,相较于现有方法,在生成形状多样性、分布相似性与保真度之间实现了较好的平衡,且当以点云、图像或草图作为条件输入时,均能有效提升CAD模型的生成质量。相关代码已开源,详情可见 https://github.com/Ziyou-maker/LDM4CAD

关键词: CAD生成, 参数化建模, 扩散模型, 潜在扩散模型, 条件生成

Abstract:

Creating 3D models with both manufacturability and editability based on traditional Computer-Aided Design (CAD) is a complex and time-consuming task. In recent years, deep learning technology has shown great potential in the automated generation of CAD models and has become a research hotspot. However, most CAD generation models fail to fully utilize the geometric and semantic information contained in input data such as point clouds, images, and sketches, making it difficult to accurately control the generation direction through flexible conditional inputs. To address this issue, the directional generation of CAD models can be achieved by exploring the representational capability of the latent space, adopting a denoising diffusion probabilistic model, and using such conditional input data as guidance. Specifically, a Transformer-based autoencoder was first constructed to encode CAD parameter command sequences into a latent space. Subsequently, a denoising diffusion probabilistic model was established within this space to generate CAD feature vectors by integrating conditional encoding information from point clouds, images, or sketches. Finally, the feature vectors were reconstructed into 3D CAD models via a decoder. Experimental results demonstrated that the generated CAD models exhibited reasonable structures, smooth surfaces, and distinct geometric features. Compared with existing methods, a superior balance was achieved among shape diversity, distribution similarity, and fidelity. Furthermore, the generation quality of CAD models was effectively enhanced when point clouds, images, or sketches were utilized as conditional inputs. The relevant code has been open-sourced and is available at https://github.com/Ziyou-maker/LDM4CAD.

Key words: CAD generation, parametric modeling, diffusion model, latent diffusion model, conditional generation

中图分类号: