基于 IFC 标准的 BIM 自适应分词方法

doi:10.11996/JG.j.2095-302X.2021020316

图学学报 ›› 2021, Vol. 42 ›› Issue (2): 316-324.DOI: 10.11996/JG.j.2095-302X.2021020316

基于 IFC 标准的 BIM 自适应分词方法

1. 北京建筑大学电气与信息工程学院，北京 100044； 2. 建筑大数据智能处理方法研究北京市重点实验室，北京 102616

出版日期:2021-04-30 发布日期:2021-04-30
基金资助:
国家自然科学基金项目(71601013)；北京市自然科学基金项目(4202017)；北京市青年拔尖人才培育项目(CIT&TCD201904050)；北京建筑大学青年英才项目；北京建筑大学市属高校基本科研业务费专项资金(X20039)

A model adaptive method for Chinese word segmentation using IFC-based building information model

1. School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing 100044, China; 2. Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing 102616, China

Online:2021-04-30 Published:2021-04-30
Supported by:
National Natural Science Foundation of China (71601013); Beijing Municipal Natural Science Foundation (4202017); Beijing Youth Talent Training Project (CIT&TCD201904050); Young Elite of Beijing University of Civil Engineering and Architecture; The Fundamental Research Funds for Beijing University of Civil Engineering and Architecture (X20039)

摘要/Abstract

摘要： 建筑信息模型(BIM)已经成为建筑行业信息技术应用的有效方案。随着 BIM 数据不断增长，为了高效使用 BIM 数据，很多研究将自然语言处理(NLP)引入 BIM 应用中。在中文环境中，由于缺乏建筑行业的术语特征，导致基础环节的中文分词在建筑领域 BIM 应用中的适应性较差。通过分析当前流行的 BIM 数据格式工业基础类(industry foundation class, IFC)文件，从中提取 BIM 模型特征，配合建筑领域术语特征加入分词模型中，以提高中文分词在建筑领域的性能。实验结果表明，与原始条件随机场(CRF)分词模型相比，在建筑领域测试集上，分词模型的 F-measure 提高了 1.26%，其中，在仅加入 BIM 模型特征时，F-measure 提升了 0.10%，说明在分词模型中加入 BIM 模型特征对于提高中文分词在建筑领域的性能是有效的。同时，在 BIM 模型测试集上，相较于仅加入建筑领域术语特征，在加入 BIM 模型特征后，准确率从 46.97%提升至 87.74%，召回率从 67.60%提升至 94.77%，F-measure 从 55.43%提升至 91.12%，提升了 35.69%，有效提高了中文分词在建筑领域的 BIM 模型自适应性。

关键词: 建筑信息模型, 工业基础类, 中文分词, 模型自适应, 建筑信息提取

Abstract: The building information model (BIM) has become an effective solution to information technology applications in the construction industry. With the continuous increase of BIM data, natural language processing (NLP) has been introduced into BIM applications in many studies to effectively utilize BIM data. In the Chinese language environment, due to the absence of terminology features in the building field, Chinese word segmentation cannot be efficiently adapted in BIM application. By analyzing the currently popular industry foundation class (IFC) files in BIM data format, this study extracted BIM model features from IFC files and added them together with architectural terminology characteristics into the statistical word segmentation model, thus improving the adaptability of Chinese word segmentation in the building field. The experimental results show that compared with the original conditional random fields (CRF)based word segmentation model, on the domain test set, the F-measure increased by 1.26%, and F-measure still increased by 0.10% with BIM model features added alone, indicating that appending BIM model features to the segmentation model can effectively improve the performance of Chinese word segmentation in the building field. Meanwhile, on the model test set, compared with the case of architectural terminology characteristics being appended alone, after BIM model features were appended, the precision rate increased from 46.97% to 87.74%, the recall rate from 67.60% to 94.77%, and the F-measure from 55.43% to 91.12% (by 35.69%), thereby effectively boosting the BIM model adaptability of Chinese word segmentation in the building field.

Key words: building information model, industry foundation classes, Chinese word segmentation, model adaptation, building information extraction

中图分类号:

TP 391

张鑫 , 周小平 , 王佳 , . 基于 IFC 标准的 BIM 自适应分词方法 [J]. 图学学报, 2021, 42(2): 316-324.

ZHANG Xin , ZHOU Xiao-ping, WANG Jia, . A model adaptive method for Chinese word segmentation using IFC-based building information model [J]. Journal of Graphics, 2021, 42(2): 316-324.

[1]	熊琛, 陈立斌, 李林泽, 许镇, 赵杨平. 基于计算机视觉与 BIM 的裂缝可视化管理方法[J]. 图学学报, 2022, 43(4): 721-728.
[2]	姜柳, 史健勇, 付功义, 潘泽宇, 王朝宇. 基于 BIM 和深度学习的建筑平面凹凸不规则识别[J]. 图学学报, 2022, 43(3): 522-529.
[3]	段锐, 邓晖, 邓逸川. ICT 支持的塔吊安全管理框架—— 回顾与展望[J]. 图学学报, 2022, 43(1): 11-20.
[4]	张文元, 谈国新. 建筑物多尺度三维语义建模研究[J]. 图学学报, 2022, 43(1): 163-171.
[5]	王永海 , 姚玲 , 陈顺清 , 包世泰 , . 城市信息模型(CIM)分级分类研究[J]. 图学学报, 2021, 42(6): 995-1001.
[6]	赵雪锋, 侯笑, 刘占省, 李梦璇. 高校 BIM 课程教学闭环管理体系研究[J]. 图学学报, 2021, 42(6): 1011-1017.
[7]	刘世龙, 马智亮. 基于 BIM 的钢筋骨架语义设计点云自动生成算法[J]. 图学学报, 2021, 42(5): 816-822.
[8]	边根庆, 陈蔚韬. 面向 Web 的建筑三维模型可视化方法研究[J]. 图学学报, 2021, 42(5): 823-832.
[9]	朱慧娴, 徐照. 装配式建筑自上而下设计信息协同与模型构建[J]. 图学学报, 2021, 42(2): 289-298.
[10]	梁裕卿, 吉久茂, 杨佳蕾, 张东升, 王珂, 王凌宇, . 基于人工智能的 BIM 疏散设计自动化方法[J]. 图学学报, 2021, 42(2): 299-306.
[11]	孙少楠, 吴家伟. 基于 BIM 技术的被动式建筑节能因子多目标优化研究[J]. 图学学报, 2021, 42(1): 124-132.
[12]	张吉松 , 赵丽华 , 崔英辉 , 任国乾 , 李海江 . 基于 BIM 模型的结构设计审查方法研究[J]. 图学学报, 2021, 42(1): 133-140.
[13]	侯学良 , 薛靖国 , 王毅 , 曾颖 . 基于投影的施工图像与 BIM 模型配准叠加方法[J]. 图学学报, 2021, 42(1): 141-149.
[14]	伍军 1,2，宋林 3,王步云 4,赵邦国 4,赵夕国 4. 面向对象和服务的桥梁工程信息管理平台研究与实践 [J]. 图学学报, 2020, 41(5): 824-832.
[15]	许璟琳，高尚，余芳强，赵震 . 建筑机电系统物理连接关系自动修复方法[J]. 图学学报, 2020, 41(5): 833-838.

基于 IFC 标准的 BIM 自适应分词方法

A model adaptive method for Chinese word segmentation using IFC-based building information model

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价