欢迎访问《图学学报》 分享到:

图学学报 ›› 2026, Vol. 47 ›› Issue (1): 68-77.DOI: 10.11996/JG.j.2095-302X.2026010068

• 图像处理与计算机视觉 • 上一篇    下一篇

无人机视角下施工场景目标检测性能评估

宋琢1, 卢德辉1, 黄志超1, 田时雨1, 颜嵘龙2, 邓逸川2,3()   

  1. 1 广州一建建设集团有限公司广东 广州 510060
    2 华南理工大学土木与交通学院广东 广州 510641
    3 亚热带城市与建筑科学全国重点实验室广东 广州 510641
  • 收稿日期:2025-03-19 接受日期:2025-07-23 出版日期:2026-02-28 发布日期:2026-03-16
  • 通讯作者:邓逸川,E-mail:ctycdeng@scut.edu.cn
  • 基金资助:
    国家自然科学基金(52308314);广东省自然科学基金-青年提升项目(2023A1515030169);广东省住房和城乡建设厅科技创新计划项目(20250305J0004);广州市建筑集团有限公司科技计划项目([2023]-KJ008)

Performance evaluation of construction site object detection under drone-captured perspective

SONG Zhuo1, LU Dehui1, HUANG Zhichao1, TIAN Shiyu1, YAN Ronglong2, DENG Yichuan2,3()   

  1. 1 Guangzhou No. 1 Construction Group Co. Ltd., Guangzhou Guangdong 510060, China
    2 School of Civil Engineering and Transportation, South China University of Technology, Guangzhou Guangdong 510641, China
    3 State Key Laboratory of Subtropical Building and Urban Science, Guangzhou Guangdong 510641, China
  • Received:2025-03-19 Accepted:2025-07-23 Published:2026-02-28 Online:2026-03-16
  • Supported by:
    National Natural Science Foundation of China(52308314);Youth Enhance Project of Natural Science Foundation of Guangdong Province(2023A1515030169);Technology Innovation Program of Guangdong Provincial Department of Housing and Urban-Rural Development(20250305J0004);Technology Program Project of Guangzhou Municipal Construction Group CO. LTD([2023]-KJ008)

摘要:

施工现场的组织管理是工程管理的关键环节,但传统的人力监管方法限制多、效率低。近年国家多部委发布有关政策,呼吁促进人工智能与实体经济深度融合,以人工智能推动经济高质高效发展。计算机视觉(CV)技术的准确性、高效性和自动化等优点使CV技术在施工监理领域的应用逐渐广泛,特别是无人机能高效获取复杂多变的施工场景视觉数据的特性显示出其在基于CV技术的施工监管任务中的应用潜力。但当前基于无人机的施工场景目标检测研究有限,且稀缺的无人机视角下的施工场景图像数据集限制着有关研究的深入发展。因此,采用大疆Mavic 3T无人机用于获取施工现场图像,以建立开源的施工场景俯拍图像数据集UB-CSD。选用多种先进目标检测算法在UB-CSD数据集上进行对比实验,从模型流程设计、计算原理和任务场景特性等维度分析各算法性能差异原因。各算法的mAP检测结果为YOLOv8和YOLOv10 (96.1%),YOLOv9 (96.0%),YOLO11 (95.7%),DETR (95.3%),Faster-RCNN (76.3%)和RetinaNet (72.1%)。分析结果表明,YOLO系列算法是基于无人机的施工场景目标检测任务算法的最优选。通过构建全新的开源专用数据集和开展对比实验得出的以上数据及结论,将为建筑业安全生产管理与日后相关检测研究提供有效数据与实验案例。

关键词: 施工场景, 无人机, 目标检测, YOLO, Faster-RCNN, DETR, RetinaNet

Abstract:

The organizational management of construction sites is a critical aspect in engineering management; however, traditional human supervision method is constrained by many environment limitations and low efficiency. In recent years, multiple government departments have issued relevant policies advocating deep integration of artificial intelligence with the real economy to promote high-quality and efficient economic development. The accuracy, efficiency, and automation advantages of Computer Vision (CV) technology have gradually led to its widespread application in the field of construction supervision. Meanwhile, the drones, which can efficiently obtain complex and varied visual data of construction scene, demonstrate their application potential in CV-based construction supervision tasks. However, the current researches on drone-based construction scene detection are limited, and the lack of overhead-perspective construction-scene image datasets restricts further development in the field. Therefore, the DJI Mavic 3T drone was utilized to obtain construction-site images to establish an open-source overhead image dataset for construction scene UB-CSD. Several advanced object-detection algorithms were selected for comparative experiments on the UB-CSD dataset, and the reasons for performance differences were analyzed from multiple dimensions such as model workflow design, computation principle, and task characteristics. The mAPs of every algorithm’s detection result were YOLOv8 and YOLOv10 (96.1%), YOLOv9 (96.0%), YOLO11 (95.7%), DETR (95.3%), Faster-RCNN (76.3%) and RetinaNet (72.1%). The analysis results indicated that the YOLO series algorithm constituted the most optical algorithm for drone-based object detection tasks in construction scenes. By establishing a new open-source special dataset and conducting comparative experiments, the conclusion drawn provided effective data and experimental cases to support future safety production management and object-detection algorithm research in the construction industry.

Key words: construction scene, drones, object detection, YOLO, Faster-RCNN, DETR, RetinaNet

中图分类号: