Journal of Graphics ›› 2026, Vol. 47 ›› Issue (2): 390-401.DOI: 10.11996/JG.j.2095-302X.2026020390
• Computer Graphics and Virtual Reality • Previous Articles Next Articles
LIU Jinghao, YOU Zhenguo, DU Dong(
)
Received:2025-09-02
Accepted:2025-12-12
Online:2026-04-30
Published:2026-05-20
Contact:
DU Dong
Supported by:CLC Number:
LIU Jinghao, YOU Zhenguo, DU Dong. Conditional generation of CAD models based on latent diffusion models[J]. Journal of Graphics, 2026, 47(2): 390-401.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2026020390
| 命令类型 | 参数 | 含义 |
|---|---|---|
| 一个回路的开始 | ||
| 线的端点 | ||
| 圆弧端点 | ||
| 扫掠角度 | ||
| 逆时针标志 | ||
| 圆心 | ||
| 半径 | ||
| 草图平面方向 | ||
| 草图平面原点 | ||
| 比例因子 | ||
| 挤出距离 | ||
| 布尔类型 | ||
| 挤出类型 | ||
| 整个序列的结束 |
Table 1 Types of CAD commands and their corresponding parameters
| 命令类型 | 参数 | 含义 |
|---|---|---|
| 一个回路的开始 | ||
| 线的端点 | ||
| 圆弧端点 | ||
| 扫掠角度 | ||
| 逆时针标志 | ||
| 圆心 | ||
| 半径 | ||
| 草图平面方向 | ||
| 草图平面原点 | ||
| 比例因子 | ||
| 挤出距离 | ||
| 布尔类型 | ||
| 挤出类型 | ||
| 整个序列的结束 |
| 方法 | COV/%↑ | JSD↓ | MMD↓ |
|---|---|---|---|
| DeepCAD | 78.6 | 4.086 | 1.509 |
| SkexGen | 76.8 | 2.110 | 1.395 |
| BrepGen | 75.1 | 1.457 | 1.245 |
| FlexCAD | 76.5 | 2.625 | 1.532 |
| 本文 | 79.1 | 3.051 | 1.348 |
Table 2 Shape generation performance of CAD models
| 方法 | COV/%↑ | JSD↓ | MMD↓ |
|---|---|---|---|
| DeepCAD | 78.6 | 4.086 | 1.509 |
| SkexGen | 76.8 | 2.110 | 1.395 |
| BrepGen | 75.1 | 1.457 | 1.245 |
| FlexCAD | 76.5 | 2.625 | 1.532 |
| 本文 | 79.1 | 3.051 | 1.348 |
| 方法 | 推理时间/ms |
|---|---|
| DeepCAD | 2.88 |
| SkexGen | 49.55 |
| BrepGen | 7164.12 |
| FlexCAD | 9485.88 |
| 本文方法 | 6.43 |
Table 3 Comparison of inference speed for unconditional generation
| 方法 | 推理时间/ms |
|---|---|
| DeepCAD | 2.88 |
| SkexGen | 49.55 |
| BrepGen | 7164.12 |
| FlexCAD | 9485.88 |
| 本文方法 | 6.43 |
| 方法 | Point Cloud | Image | Sketch | |||
|---|---|---|---|---|---|---|
| ACC_cmd | ACC_param | ACC_cmd | ACC_param | ACC_cmd | ACC_param | |
| DeepCAD | 74.91 | 61.22 | 63.15 | 52.04 | 61.96 | 47.36 |
| 本文 | 86.54 | 71.80 | 76.98 | 65.86 | 68.19 | 56.02 |
Table 4 Comparison of CAD model reconstruction metrics under different conditions
| 方法 | Point Cloud | Image | Sketch | |||
|---|---|---|---|---|---|---|
| ACC_cmd | ACC_param | ACC_cmd | ACC_param | ACC_cmd | ACC_param | |
| DeepCAD | 74.91 | 61.22 | 63.15 | 52.04 | 61.96 | 47.36 |
| 本文 | 86.54 | 71.80 | 76.98 | 65.86 | 68.19 | 56.02 |
Fig. 7 CAD generation results based on different conditional data ((a) Generation results based on point clouds; (b) Generation results based on images; (c) Generation results based on sketches)
| [1] | 刘爱军, 黄松波, 闫光荣. 三维CAD混合建模技术研究[J]. 图学学报, 2013, 34(6): 61-63. |
| LIU A J, HUANG S B, YAN G R. Research on hybrid modeling technology for CAD model[J]. Journal of Graphics, 2013, 34(6): 61-63 (in Chinese). | |
| [2] | 黄学良, 李娜, 陈立平. 三维装配几何约束组合的分类求解策略[J]. 图学学报, 2014, 35(2): 236-242. |
| HUANG X L, LI N, CHEN L P. Classification and solution of 3D assembly geometric constraint system between two rigid bodies[J]. Journal of Graphics, 2014, 35(2): 236-242 (in Chinese). | |
| [3] | ROMBACH R, BLATTMANN A, LORENZ D, et al. High-resolution image synthesis with latent diffusion models[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2022: 10674-10685. |
| [4] | VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C]// The 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2017: 6000-6010. |
| [5] | WU J J, ZHANG C K, XUE T F, et al. Learning a probabilistic latent space of object shapes via 3D generative-adversarial modeling[C]// The 30th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2016: 82-90. |
| [6] |
GOODFELLOW I, POUGET-ABADIE J, MIRZA M, et al. Generative adversarial networks[J]. Communications of the ACM, 2020, 63(11): 139-144.
DOI URL |
| [7] | YANG G D, HUANG X, HAO Z K, et al. PointFlow: 3D point cloud generation with continuous normalizing flows[C]// 2019 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2019: 4540-4549. |
| [8] | WANG N Y, ZHANG Y D, LI Z W, et al. Pixel2Mesh: generating 3D mesh models from single RGB images[C]// The 15th European Conference on Computer Vision. Cham: Springer, 2018: 55-71. |
| [9] | MESCHEDER L, OECHSLE M, NIEMEYER M, et al. Occupancy networks: learning 3D reconstruction in function space[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2019: 4455-4465. |
| [10] | PARK J J, FLORENCE P, STRAUB J, et al. DeepSDF: learning continuous signed distance functions for shape representation[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2019: 165-174. |
| [11] | MILDENHALL B, SRINIVASAN P P, TANCIK M, et al. NeRF: representing scenes as neural radiance fields for view synthesis[J]. Communications of the ACM, 2022, 65(1): 99-106. |
| [12] | SHARMA G, GOYAL R, LIU D F, et al. CSGNet: neural shape parser for constructive solid geometry[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 5515-5523. |
| [13] | KANIA K, ZIĘBA M, KAJDANOWICZ T. UCSG-NET- unsupervised discovering of constructive solid geometry tree[C]// The 34th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2020: 736. |
| [14] | WANG X G, XU Y L, XU K, et al. PIE-NET: parametric inference of point cloud edges[C]// The 34th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2020: 1693. |
| [15] | JAYARAMAN P K, LAMBOURNE J G, DESAI N, et al. SolidGen: an autoregressive model for direct B-rep synthesis[EB/OL]. [2025-05-05]. https://arxiv.org/abs/2203.13944. |
| [16] | VINYALS O, FORTUNATO M, JAITLY N. Pointer networks[C]// The 29th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2015: 2692-2700. |
| [17] | XU X, LAMBOURNE J, JAYARAMAN P, et al. BrepGen: a B-rep generative diffusion model with structured latent geometry[J]. ACM Transactions on Graphics, 2024, 43(4): 119. |
| [18] | LI J, FU Y H, CHEN F L. DTGBrepGen: a novel B-rep generative model through decoupling topology and geometry[C]// 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2025: 21438-21447. |
| [19] | WU R D, XIAO C, ZHENG C X. DeepCAD: a deep generative network for computer-aided design models[C]// 2021 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2021: 6752-6762. |
| [20] | XU X, WILLIS K D D, LAMBOURNE J G, et al. SkexGen: autoregressive generation of cad construction sequences with disentangled codebooks[EB/OL]. [2025-07-02]. https://proceedings.mlr.press/v162/xu22k.html. |
| [21] | XU X, JAYARAMAN P K, LAMBOURNE J G, et al. Hierarchical neural coding for controllable CAD model generation[EB/OL]. [2025-07-02]. https://proceedings.mlr.press/v202/xu23f.html. |
| [22] | ZHANG Z W, SUN S Z, WANG W X, et al. FlexCAD: unified and versatile controllable CAD generation with fine-tuned large language models[EB/OL]. [2025-12-08]. https://arxiv.org/abs/2411.05823. |
| [23] | LI C J, PAN H, BOUSSEAU A, et al. Sketch2CAD: sequential cad modeling by sketching in context[J]. ACM Transactions on Graphics, 2020, 39(6): 164. |
| [24] | Li C J, PAN H, BOUSSEAU A, et al. Free2CAD: parsing freehand drawings into CAD commands[J]. ACM Transactions on Graphics, 2022, 41(4): 93. |
| [25] | HÄHNLEIN F, LI C J, MITRA N J, et al. CAD2Sketch: generating concept sketches from CAD sequences[J]. ACM Transactions on Graphics, 2022, 41(6): 279. |
| [26] | UY M A, CHANG Y Y, SUNG M, et al. Point2Cyl: reverse engineering 3D objects from point clouds to extrusion cylinders[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2022: 11840-11850. |
| [27] | REN D X, ZHENG J M, CAI J F, et al. ExtrudeNet: unsupervised inverse sketch-and-extrude for shape parsing[C]// The 17th European Conference on Computer Vision. Cham: Springer, 2022: 482-498. |
| [28] | LI P, GUO J W, ZHANG X P, et al. SECAD-net: self-supervised CAD reconstruction by learning sketch-extrude operations[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 16816-16826. |
| [29] | ZHOU S D, TANG T Y, ZHOU B. CADParser: a learning approach of sequence modeling for B-rep CAD[EB/OL]. [2025-07-02]. https://dblp.org/rec/conf/ijcai/ZhouTZ23.html?view=bibtex. |
| [30] | HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[C]// The 34th International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2020: 574. |
| [31] | CHOI J, KIM S, JEONG Y, et al. ILVR: conditioning method for denoising diffusion probabilistic models[C]// 2021 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2021: 14347-14356. |
| [32] | GU S Y, CHEN D, BAO J M, et al. Vector quantized diffusion model for text-to-image synthesis[C]// 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2022: 10686-10696. |
| [33] | VAN DEN OORD A, VINYALS O, KAVUKCUOGLU K. Neural discrete representation learning[C]// The 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2017: 6309-6318. |
| [34] |
ZHANG A J, JIA W Q, ZOU Q, et al. Diffusion-CAD: controllable diffusion model for generating computer-aided design models[J]. IEEE Transactions on Visualization and Computer Graphics, 2025, 31(12): 10188-10199.
DOI URL |
| [35] | ALAM M F, AHMED F. GenCAD: image-conditioned computer-aided design generation with transformer-based contrastive representation and diffusion priors[EB/OL]. [2025-05-05]. https://arxiv.org/abs/2409.16294. |
| [36] |
WANG H X, ZHAO M Y, Wang Y Q, et al. VQ-CAD: computer-aided design model generation with vector quantized diffusion[J]. Computer Aided Geometric Design, 2024, 111: 102327.
DOI URL |
| [37] | QI C R, YI L, SU H, et al. PointNet++: deep hierarchical feature learning on point sets in a metric space[C]// The 31st International Conference on Neural Information Processing Systems. Red Hook: Curran Associates Inc., 2017: 5105-5114. |
| [38] | HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 770-778. |
| [39] |
CANNY J. A computational approach to edge detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1986, PAMI-8(6): 679-698.
DOI URL |
| [40] | WILLIS K D D, PU Y W, LUO J L, et al. Fusion 360 gallery: a dataset and environment for programmatic CAD construction from human design sequences[J]. ACM Transactions on Graphics, 2021, 40(4): 54. |
| [1] | ZHAO Zhenbing, ZHANG Jingliang, TANG Chenkang, BI Yuxuan, LI Haopeng. Precise-oil leakage segmentation for substation equipment under water-accumulation interference [J]. Journal of Graphics, 2026, 47(2): 296-310. |
| [2] | CHEN Mengqi, ZHAO Junli, DENG Xiaodan. SAM-based mask generation and segmentation for dermatological images [J]. Journal of Graphics, 2026, 47(2): 322-331. |
| [3] | ZHOU Jin, ZHOU Yi, XU Pengfei, HUANG Hui. PDF-Sketch: layout-based sketch generation via primitive distance fields and discrete diffusion [J]. Journal of Graphics, 2026, 47(2): 380-389. |
| [4] | DENG Peng, TAN Wenzheng, LUO Huiming, LI Shuai, YANG Bin. Research of parametric modeling methods for isolated foundation based on Revit secondary development [J]. Journal of Graphics, 2026, 47(1): 194-203. |
| [5] | LIU Defeng, CHEN Weizheng, BAI Yaqiang, LIU Kai, WANG Qi. Exploration of forward design methods for ship conceptual schemes based on conditional generative models [J]. Journal of Graphics, 2025, 46(6): 1209-1215. |
| [6] | YE Wenlong, CHEN Bin. PanoLoRA: an efficient finetuning method for panoramic image generation based on Stable Diffusion [J]. Journal of Graphics, 2025, 46(5): 980-989. |
| [7] | LEI Songlin, ZHAO Zhengpeng, YANG Qiuxia, PU Yuanyuan, GU Jinjing, XU Dan. Zero-shot style transfer based on decoupled diffusion models [J]. Journal of Graphics, 2025, 46(4): 727-738. |
| [8] | SUN Heyi, LI Yixiao, TIAN Xi, ZHANG Songhai. Image to 3D vase generation technology combining procedural content generation and diffusion models [J]. Journal of Graphics, 2025, 46(2): 332-344. |
| [9] | LI Jiyuan, GUAN Zheyu, SONG Haichuan, TAN Xin, MA Lizhuang. Human-in-the-loop field-specific logo generation method [J]. Journal of Graphics, 2025, 46(2): 382-392. |
| [10] | TU Qinghao, LI Yuanqi, LIU Yifan, GUO Jie, GUO Yanwen. Generalization optimization method for text to material texture maps based on diffusion model [J]. Journal of Graphics, 2025, 46(1): 139-149. |
| [11] | ZHANG Ji, CUI Wenshuai, ZHANG Ronghua, WANG Wenbin, LI Yaqi. A text-driven 3D scene editing method based on key views [J]. Journal of Graphics, 2024, 45(4): 834-844. |
| [12] | WANG Ji, WANG Sen, JIANG Zhi-wen, XIE Zhi-feng, LI Meng-tian. Zero-shot text-driven avatar generation based on depth-conditioned diffusion model [J]. Journal of Graphics, 2023, 44(6): 1218-1226. |
| [13] | ZOU Qiang. A note on solid modeling: history, state of the art, future [J]. Journal of Graphics, 2022, 43(6): 987-1001. |
| [14] | GAO Dai1, WANG Hongyang2, DU Jiahe2, CAI Ziang2, LOBSANG Tsring2. Research on BIM Parametric Modeling Method of Chinese Classical Architectures [J]. Journal of Graphics, 2018, 39(2): 333-338. |
| [15] | HU Yechang, WEI Zhifang, WANG Zhiwei. Research on Parametric Modeling Method of Stranded Wire Helical Torsion Spring [J]. Journal of Graphics, 2017, 38(6): 820-825. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||