Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (6): 1337-1345.DOI: 10.11996/JG.j.2095-302X.2025061337

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

Geometry hypergraph aware 3D scene graph generation

LIU Yuanyuan1,2(), FANG Youjiang1,2, MENG Tianyu1,2, MENG Zhengyu1,2, LUO Pengwei1,2, YANG Peigen1,2, JIANG Yutong3, WEI Xiaopeng1,2, ZHANG Qiang1,2, YANG Xin1,2()   

  1. 1 School of Computer Science and Technology, Dalian University of Technology, Dalian Liaoning 116024, China
    2 Key Laboratory of Social Computing and Cognitive Intelligence of Ministry of Education, Dalian Liaoning 116024, China
    3 Chinese Scholartree Ridge State Key Laboratory, China North Vehicle Research Institute, Beijing 100072, China
  • Received:2024-10-09 Accepted:2025-04-15 Online:2025-12-30 Published:2025-12-27
  • Contact: YANG Xin
  • About author:First author contact:

    LIU Yuanyuan (1999-), PhD candidate. Her main research interests cover computer graphics and scene semantic understanding. E-mail:Lyy990415@gmail.com

  • Supported by:
    Science and Technology Innovation 2030 - “New Generation Artificial Intelligence” Major Project(2021ZD12400)

Abstract:

In the field of computer graphics and vision, 3D scene graph generation (SGG) has gained widespread attention in recent years. While existing research has improved the accuracy of coarse-grained classification and single-relation labels, performance in fine-grained classification and multi-label scenarios remains inadequate, limiting real-world applications. To address this, an innovative framework was proposed to fully utilizes contextual information to achieve fine-grained entity classification, multi-relation labeling, and enhanced accuracy. Our method comprised two core modules: the graph feature extraction (GFE) module and the graph context inference (GCI) module. The GFE module was used to extract entity and interaction semantic features from input data to ensure the extraction of key information. The GCI module introduced structural features from both traditional graphs and hypergraphs, analyzed relationships between entities, identified relational proximity within neighborhoods, and merged entities with similar interaction patterns to learn their interactions. The geometric hypergraph structure was dynamically generated based on scene layouts, providing structured organizational information. Experimental evaluations on the 3DSSG dataset, by integrating the organizational capabilities of both traditional graphs and hypergraphs for node and relationship clustering, the proposed work effectively improved fine-grained classification and multi-relation label recognition in 3D SGG tasks.

Key words: geometric hypergraph, 3D scene graph generation, structured organization, node clustering, multi-relational extraction, adaptive updating

CLC Number: