Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (3): 594-600.DOI: 10.11996/JG.j.2095-302X.2024030594

• BIM/CIM • Previous Articles     Next Articles

A new interaction paradigm for building design driven by large language model: proof of concept with Rhino7

JIANG Can1,2(), ZHENG Zhe2, LIANG Xiong1, LIN Jiarui2,3(), MA Zhiliang2, LU Xinzheng2   

  1. 1. Glodon Company Limited, Beijing 100193, China
    2. Department of Civil Engineering, Tsinghua University, Beijing 100084, China
    3. Key Laboratory of Digital Construction and Digital Twin, Ministry of Housing and Urban-Rural Development, Beijing 100084, China
  • Received:2023-09-25 Accepted:2023-12-21 Online:2024-06-30 Published:2024-06-12
  • Contact: LIN Jiarui (1987-), associate professor, Ph.D. His research interests are intelligent construction, digital twin and knowledge graph, etc. E-mail:lin611@tsinghua.edu.cn
  • About author:

    JIANG Can (1993-), postdoctoral, Ph.D. His main research interest covers application of artificial intelligence in intelligent construction. E-mail:jiangc-l@glodon.com

  • Supported by:
    National Natural Science Foundation of China(52378306);Research Project of Beijing Municipal Science & Technology Commission, Administrative Commission of Zhongguancun Science Park(20220468132)

Abstract:

As society places higher demands on the quality of building designs, design software has become more professional and complicated. Current design software not only incurs high learning costs but also features complex interaction modes. The recent breakthroughs in large language models (LLM) have enabled computers to clearly comprehend instructions based on human natural language and accurately generate code, which is expected to provide new ideas for the paradigm of human interaction with software. Therefore, this study designed a new paradigm of interactive building design driven by LLM, i.e., shifting from the designers interacting with the design software through multiple keyboard and mouse operations to LLMs writing scripts to invoke APIs according to architects’ instructions. The methodology was proposed and its implementation feasibility in building design was validated. The methodology included: ① LLM retrieved task-related APIs from the API set according to user instructions; ② LLM wrote a program script based on instructions and the abstract of candidate APIs and ran it; ③ LLM revised the script written based on the feedback from the environment, users, etc. To validate the capabilities of current LLMs in executing the key steps of the methodology, multiple design tasks were completed with Rhino7 design software, GPT-4, and CodeLlaMa. The results not only demonstrated that the LLM-driven interactive design paradigm held initial prospects for implementation in building design, but also provided experiences and suggestions for its implementation. The implementation of this design paradigm could reduce the threshold and learning costs, improving the efficiency in many scenarios, and was expected to play a key role in future building design software.

Key words: building design software, interaction with software, large language model, application programming interface, GPT-4, Rhino7, Ladybug

CLC Number: