Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (6): 1178-1187.DOI: 10.11996/JG.j.2095-302X.2024061178

• Special Topic on “Large Models and Graphics Technology and Applications” • Previous Articles     Next Articles

Large language model powered UI evaluation system

CHEN Xiaojiao1,2(), SHU Yunfeng1,2, WANG Ruihan1,2, ZHOU Jiahuan1,2, CHEN Wei1,2   

  1. 1. State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou Zhejiang 310058, China
    2. Laboratory of Art and Archaeology Image (Zhejiang University), Ministry of Education, Hangzhou Zhejiang 310058, China
  • Received:2024-08-02 Accepted:2024-10-06 Online:2024-12-31 Published:2024-12-24
  • About author:First author contact:

    CHEN Xiaojiao (1988-), professor, Ph.D. Her main research interests cover human-computer interaction interface, user experience design practice, digital humanities, etc. E-mail:chenxiaojiao@zju.edu.cn

  • Supported by:
    National Natural Science Foundation of China(52205290);Aeronautical Science Foundation(2022Z005076001)

Abstract:

The quality of user interface (UI) design directly impacts product usability and user experience. Designers often face challenges related to consistency and accessibility during the UI design process, increasing cognitive load for users reducing efficiency. Despite awareness of these issues, they currently lack comprehensive knowledge and tools or automatic identification and resolution. To address this challenge, a comprehensive set of UI design evaluation criteria was proposed, covering five key aspects: color, text, layout, control, and icon, specifically targeting consistency and accessibility issues in UI design. Based on these evaluation criteria, a prompt template for evaluating UI consistency and accessibility was developed to enhance the accuracy of large language models (LLMs) like GPT-4 in UI evaluation tasks. Furthermore, a UI evaluation system based on the GPT-4 model was developed. This [26] deeply understood UI design content, automatically detected UI design issues according to the evaluation criteria, and provided targeted improvement suggestions to help designers optimize their UI designs. Experimental results demonstrated that using the prompt template significantly improved the accuracy of GPT-4 in UI evaluations. User studies indicated that employing this UI evaluation system in design practice can significantly enhance the quality of UI designs, thereby boosting product usability and user experience. This system provided designers with an automated UI evaluation tool, offering a new approach to enhancing UI design quality.

Key words: graphical user interface, large language model, UI evaluation, consistency, accessibility

CLC Number: