Journal of Graphics

Cover of issue 2, 2023

2023, 44(2): 0-0.

Abstract ( 191 )

PDF (1584KB) ( 167 )

Related Articles | Metrics

3D shape completion via deep learning: a method survey

YANG Liu, WU Xiao-qun

2023, 44(2): 201-215. DOI: 10.11996/JG.j.2095-302X.2023020201

Abstract ( 1043 )

HTML ( 55 )

PDF (2654KB) ( 563 )

The task of 3D shape completion, a fundamental aspect of computer graphics and computer vision, has been widely employed in many fields. 3D shape completion aims to infer complete shapes from partially missing shape data. This paper reviewed the current 3D model completion algorithms based on deep learning, and analyzed their advantages and disadvantages. According to the different forms of descriptors, the 3D model completion algorithms could be broadly classified into two categories: the completion methods based on 2D shape descriptors and the completion methods based on 3D shape descriptors. The former involved the projecting of the 3D model into the 2D space for feature extraction to obtain a complete model, including 3D model completion methods based on 2D images and depth maps. The latter involved the direct use of 3D representation for model completion, and according to different representations of 3D models, could be further divided into voxel-based, point cloud-based and implicit-based methods. Meanwhile, this survey provided an overview of the commonly used datasets, metrics, and state-of-the-art performance in the field. It also analyzed and discussed the problems facing current 3D model completion algorithms based on deep learning, and suggested potential avenues for future research.

Figures and Tables | References | Related Articles | Metrics

High resolution reconstruction of temperature field based on cascaded dense residual network

ZHANG Li-feng, LI Jing

2023, 44(2): 216-224. DOI: 10.11996/JG.j.2095-302X.2023020216

Abstract ( 234 )

HTML ( 3 )

PDF (1311KB) ( 116 )

The high-quality measurement of temperature distribution is of great importance for industrial production. As a non-invasive measurement method, acoustic tomography (AT) is considered as a promising technique for the visualization of temperature distribution. To enhance the reconstruction quality, a two-stage high-resolution reconstruction algorithm was proposed for temperature field based on virtual observation (VO) and cascaded dense residual network (CDRNN). Firstly, the temperature field of coarse grid was obtained by the virtual observation algorithm, and then the CDRNN was built to predict the fine grid temperature distribution. The VO algorithm was used to achieve the overall least squares solution of the AT inverse problem, thereby reducing the reconstruction error caused by the bending of the acoustic path. Additionally, a dual-input compensation strategy was introduced to increase the utilization of the original measurement information by the CDRNN, and to improve the network stability. The network structure was streamlined by setting up sub-networks, dense connections and residual connections were also employed to improve network information flow. Sub-pixel convolutional layers were introduced to decrease network computing dimensions and boost reconstruction accuracy. Finally, the effectiveness of the algorithm was verified using a variety of simulated temperature field models. Through the numerical simulation of a typical temperature field model and comparison with the Landweber iterative method, ART algorithm, ART-NN algorithm, and VO algorithm, it was found that the average relative error and root mean square error of the VO-CDRNN algorithm were 0.44% and 0.68%, respectively, thus achieving better reconstruction results than other algorithms.

Figures and Tables | References | Related Articles | Metrics

Dynamic balanced multi-scale feature fusion for colorectal polyp segmentation

LU Qiu, SHAO Hua-ze, ZHANG Yun-lei

2023, 44(2): 225-232. DOI: 10.11996/JG.j.2095-302X.2023020225

Abstract ( 299 )

HTML ( 7 )

PDF (2611KB) ( 190 )

Colorectal cancer is one of the most prevalent diseases, and accurate colorectal polyp segmentation can aid physicians in early prevention. However, in the process of segmentation, colorectal polyp images present several challenges, such as low contrast, varied shapes of lesions, and randomized location. Moreover, with large parameters, the Unet network does not yield high segmentation accuracy. Therefore, an improved Unet algorithm based on dynamic balanced multiscale feature fusion was proposed. This algorithm took Unet as the main body and combined the atrous spatial pyramid pooling module (ASPP). A channel shuffle inception (CSI) module and a group inception (GI) module were also put forth to improve the convolution block of the codec, reduce the amount of network parameters, and improve the model′s characterization ability. Additionally, a residual pyramid split attention module (RPSA) was presented for the skip connection of the codec, balancing the channel information in the skip connection, and improving the overall network split performance. Experimental results showed that this method could not only outperform other methods in terms of segmentation effect, but also significantly reduce the number of parameters, thereby proving its effectiveness.

Figures and Tables | References | Related Articles | Metrics

P-CenterNet for chimney detection in optical remote-sensing images

XIE Guo-bo, HE Di-xuan, HE Yu-qin, LIN Zhi-yi

2023, 44(2): 233-240. DOI: 10.11996/JG.j.2095-302X.2023020233

Abstract ( 300 )

HTML ( 8 )

PDF (3035KB) ( 106 )

Industrial chimney emissions are among the primary drivers of urban air pollution, with the urban environment quality inversely related to the quantity of chimneys therein. Therefore, the detection of chimney placement exerts a positive impact on urban environmental detection and governance. To address the problem of low detection accuracy caused by complex backgrounds and small targets in optical remote sensing images with numerous similar objects, P-CenterNet, a CenterNet-based detector, has been proposed for chimney detection tasks. Firstly, P-CenterNet employed pyramidal convolution in the backbone network to obtain richer linguistic features, instead of normal convolution in the backbone network. Secondly, a multi-scale contextual feature extraction module was designed in parallel with the backbone network to retain low-level feature information that helped distinguish object regions from background regions. Finally, a convolutional block attention module was added to further extract the output features of the backbone network to improve the detector′s representation of small targets. Furthermore, DIOR, a large-scale public dataset, was applied for the validation of the model in the experiments. The dataset was expanded and the robustness of the model was enhanced via both online and offline enhancements. The results indicated that P-CentreNet could significantly improve detection accuracy with a similar detection time cost, compared with other models such as Faster-RCNN and YOLOv3, with mAP reaching 89.77%.

Figures and Tables | References | Related Articles | Metrics

A night traffic scene enhancement algorithm based on double fusion Unet light suppression curve estimation

GAO Tao, WANG Dui-e, CHEN Ting, WANG Xiao

2023, 44(2): 241-248. DOI: 10.11996/JG.j.2095-302X.2023020241

Abstract ( 285 )

HTML ( 11 )

PDF (994KB) ( 141 )

The existing enhancement methods were found to be inadequate for dealing with night traffic images characterized by multiple and complex light sources and uneven brightness distribution, and were prone to overexposure and image blurring. To address this problem, a night traffic image enhancement algorithm for light suppression curve estimation based on double fusion Unet was proposed. First, the glow decomposition model was introduced to suppress the light of the input image, suppressing the noise of the image while removing the influence of artificial light sources. Secondly, the double fusion Unet network was utilized, where the designed double fusion module could integrate more layers in the encoding and decoding process. The feature information preserved more image details when extracting illumination information, thereby predicting the illumination distribution map better suited for the input image. Finally, the suppression image, the original night traffic image, and the illumination parameter map extracted by the network served as input, and the improved curve estimation algorithm was applied, thus enhancing the input night traffic image and improving visual quality of the image. Experimental results showed that the proposed algorithm could outperform its counterparts in both subjective and objective comparisons, proving its effectiveness, particularly in the cases of many light sources with uneven distribution.

Figures and Tables | References | Related Articles | Metrics

Object detection in remote sensing image based on two-stage anchor and class balanced loss

ZENG Lun-jie, CHU Jun, CHEN Zhao-jun

2023, 44(2): 249-259. DOI: 10.11996/JG.j.2095-302X.2023020249

Abstract ( 294 )

HTML ( 9 )

PDF (18384KB) ( 122 )

Due to the large differences in the number of different categories of targets in the existing remote sensing image data collection, the distribution of categories in the dataset is unbalanced, affecting the detection accuracy of network models for a few categories. In light of the aforementioned challenges, a two-stage anchor frame and class-balanced loss target detection algorithm for remote sensing images was presented. The class balance labels of remote sensing datasets were generated by K-means clustering, subsequently utilized as the initial center for the second stage of K-means clustering. The resulting preset anchor frames were able to take into account a few class scales and improve the detection accuracy of a few class instances. At the same time, class equalization loss (CEQL) was constructed. Based on equalization loss (EQL), effective samples were used to construct auxiliary weights to improve the model′s attention to a few categories during training. The experimental results demonstrated that the improved model achieved an average accuracy of 76.13% and a few categories′ average accuracy of 76.51%, increasing by 1.56% and 1.75%, respectively, compared with the datum network. When evaluated on the DOIR and NWPU VHR-10 datasets, and compared with the main methods such as Faster-RCNN, RetinaNet, CenterNet, YOLOv4, YOLOX-L, YOLOv5, and YOLOv7, the experiment showed that the improved algorithm could effectively improve the detection accuracy of a few categories while maintaining the detection accuracy of most categories.

Figures and Tables | References | Related Articles | Metrics

Saliency detection-guided for image data augmentation

ZENG Wu, ZHU Heng-liang, XING Shu-li, LIN Jiang-hong, MAO Guo-jun

2023, 44(2): 260-270. DOI: 10.11996/JG.j.2095-302X.2023020260

Abstract ( 406 )

HTML ( 19 )

PDF (3104KB) ( 145 )

In view of the fact that most data augmentation methods tend to be overly random in their selection of cropped regions, and tend to place too much emphasis on the feature salient regions in the image while neglecting the reinforcement learning of the poorly discriminative regions in the image, the SaliencyOut and SaliencyCutMix methods were proposed to enhance the learning of poorly discriminative regions in images. Specifically, SaliencyOut first employed the saliency detection technology to generate a saliency map of the original image, subsequently identifying a feature salient area in the saliency map and removing the pixels in this area. SaliencyCutMix, on the other hand, removed the cropped area of the original image and replaced it with the same area of the patch image. By occluding or replacing some feature salient areas in the image, the model was guided to learn other features about the target object. In addition, to address the issue of losing too many salient feature regions in the cases of large cropping areas, an adaptive scaling factor was incorporated in the selection of the cropping boundary. This factor enabled the dynamic adjustment of the size of the cropping boundary according to the difference in the initial size of the cropping area boundary. Experimental results on four datasets showed that the proposed method could significantly improve the classification performance and anti-interference ability of the model, surpassing most advanced methods. In particular, in the Mini-ImageNet dataset, when applied to the ResNet-34 network, SaliencyCutMix could improve the Top-1 accuracy by 1.18% compared to CutMix.

Figures and Tables | References | Related Articles | Metrics

Flowers recognition based on lightweight visual transformer

XIONG Ju-ju, XU Yang, FAN Run-ze, SUN Shao-cong

2023, 44(2): 271-279. DOI: 10.11996/JG.j.2095-302X.2023020271

Abstract ( 707 )

HTML ( 30 )

PDF (9669KB) ( 289 )

Due to the similarity between different kinds of flowers and the dissimilarity within the same kind of flowers, the results of convolutional neural network (CNN) that extracts local feature information in flower image recognition are not ideal. Based on the Swin Transformer (Swin-T) network, this paper proposed a lightweight Transformer network LWFormer. Firstly, the network introduced the mobile window-based PoolFormer module into the first and second stages of the Swin-T network to lightweight the network. Secondly, a dual-channel attention mechanism was introduced, in which two independent channels focused on the “location” and “content” of the feature map, respectively, to improve the network′s ability to extract global feature information. Finally, a contrastive loss function was employed to further optimize the performance of the network. The enhanced model was evaluated on two public datasets, Oxford 102 Flower Dataset and 104 Flowers Garden of Eden, and compared with other methods. On these two datasets, the accuracy rates were 88.1% and 87.3%, respectively. Compared with the Swin-T network, the network parameters were reduced by 33.45%, FLOPs was reduced by 28.89%, throughput was increased by 91.45%, and accuracy was increased by 1.8%. Experimental results showed that the proposed network could improve the accuracy while reducing the number of parameters, thus enhancing the speed and accuracy.

Figures and Tables | References | Related Articles | Metrics

Unsupervised person re-identification with multi-branch attention network and similarity learning strategy

FENG Zun-deng, WANG Hong-yuan, LIN Long, SUN Bo-yan, CHEN Hai-qin

2023, 44(2): 280-290. DOI: 10.11996/JG.j.2095-302X.2023020280

Abstract ( 253 )

HTML ( 4 )

PDF (938KB) ( 89 )

The challenge facing the unsupervised person re-identification (Re-ID) lies in learning discriminative features without true labels. To address this, a person re-identification feature extraction method based on multi-branch attention network was proposed, in order to enhance the ability of the network to express pedestrian features and capture more abundant feature information from spatial and channel dimensions. This method could learn a more discriminative representation of pedestrian features by capturing the interaction information between different branches on the spatial dimension and the channel dimension. In addition, to tackle the issue of noisy labels interfering with cluster centroids, a similarity learning strategy (SLS) was proposed. This strategy first calculated the similarity between the sample features in each cluster, and then selected the samples corresponding to the feature vector with the highest similarity score for contrastive learning, thereby effectively mitigating the cumulative training error caused by noisy labels. The experimental results revealed that compared with the self-paced contrastive learning (SPCL) method in the unsupervised scenarios, the rank-1 precision on the three datasets Market1501, DukeMTMC-reID, and MSMT17 was increased by 4.6%, 3.3%, and 16.3%, respectively, significantly enhancing the retrieval accuracy of unsupervised person re-identification.

Figures and Tables | References | Related Articles | Metrics

Research on safety clothing detection method for surveillance video of thermal power plant

CHEN Gang, ZHANG Pei-ji, GONG Dong-dong, YU Jun-qing

2023, 44(2): 291-297. DOI: 10.11996/JG.j.2095-302X.2023020291

Abstract ( 255 )

HTML ( 1 )

PDF (2491KB) ( 87 )

In the construction of a smart power plant, computer vision technology is usually utilized to detect the surveillance video returned by surveillance cameras deployed in industrial plants to monitor whether workers are wearing safety clothing. However, due to the complexity of the scenes in the thermal power plant, the existing datasets and algorithms proved inadequate in achieving the desired level of accuracy. Based on the industrial surveillance video, a dataset for safety clothing target detection in the thermal power plant scene was constructed. To address the problem of YOLOv5 being insufficient in the detection accuracy for safety clothing targets, various algorithm models such as EfficientNet, ResNet-50, ShuffleNet, and MobileNet were implemented as alternatives to the original YOLOv5 Backbone module network structure. Additionally, model fusion algorithms based on YOLOv5 were proposed. Based on the industrial scene safety clothing target detection dataset, the optimal algorithms in related fields were selected and compared with the optimized YOLOv5 algorithm. The experimental results demonstrated that the accuracy of the improved YOLOv5+EfficientNet algorithm in the industrial scene security service dataset has been significantly improved, with the highest detection accuracy of 96.6%.

Figures and Tables | References | Related Articles | Metrics

Quaternion patch-group sparse coding for color image denoising

SHI Miao-wen, FAN Lin-wei, WANG Hua, ZHANG Cai-ming

2023, 44(2): 298-303. DOI: 10.11996/JG.j.2095-302X.2023020298

Abstract ( 270 )

HTML ( 5 )

PDF (2222KB) ( 115 )

Images are inevitably corrupted by noise during transition and acquisition, which exerts a considerable influence on the subsequent processing. Therefore, image denoising is essential for image processing. Specially, the critical challenge of image denoising is to remove the noise while preserving information as much as possible. Generally, the group-based sparse representation model is exploited to restore the clean image, due to the self-similarity of natural images. This paper offered a novel color image denoising method that utilized quaternions in the group sparsity model, where each pixel was expressed as a pure quaternion. Initially, each pixel of the observed image was expressed as a quaternion unit, and a quaternion patch group matrix was established by Pearson′s correlation coefficient. The proposed model then learnt the dictionary for each patch group, working well with the pursuit algorithm. In other words, the group-based sparsity method assumed that each patch group was a linear combination of the basic elements of the dictionary. Unfortunately, it remained arduous to reconstruct the image structure precisely. Therefore, the group sparsity model incorporated kernel Wiener filtering to enhance image structure quality. In contrast to the traditional models, the new model not only worked with the corresponding RGB channels, but also leveraged the relationship between patches. Fueled by the exploration of the inner correlation of color channels, the proposed method could preserve the image information as much as possible while removing noise. The experiments validated the efficiency of the proposed method both in numerical results and visual performance on different noise levels.

Figures and Tables | References | Related Articles | Metrics

Prediction of damaged areas in Yunnan murals using Dual Dense U-Net

LUO Qi-ming, WU Hao, XIA Xin, YUAN Guo-wu

2023, 44(2): 304-312. DOI: 10.11996/JG.j.2095-302X.2023020304

Abstract ( 286 )

HTML ( 8 )

PDF (11291KB) ( 95 )

The prediction of damaged areas of murals constitutes an important part of the virtual restoration of murals. However, current methods are prone to problems such as incomplete prediction of damaged areas and inaccurate prediction of damaged boundaries of the complex texture area in the case of Yunnan Minority Murals. To address these challenges, an improved Dual Dense U-Net segmentation model based on U-Net was proposed. This method enhanced the location and texture features of damaged regions, resulting in more discriminative information and the improved accuracy of damaged mask prediction. To enable the model to learn mural features more effectively, a segmentation dataset containing 5,000 images of Yunnan Minority Murals was established. The Dual Dense U-Net model employed a fusion module to perform a multi-scale fusion of mural images, mitigating the loss of local texture information and spatial position information in the feedforward process of mural images. First, the U-Net structure was used to extract information from the input mural image. The fusion module was comprised of multiple depthwise separable convolutions, which could improve the efficiency and segmentation accuracy of the fusion module. Secondly, the fusion module connected two U-Nets to further strengthen the connection between shallow features and deep features. The experimental results revealed that the IoU and Dice evaluation indicators of the model were improved by 3 percentage points compared with UNet++, and that the damaged areas predicted by the model could significantly improve the restoration effect of the murals restoration network. The proposed model was thus proven to be effective in predicting damaged areas of murals.

Figures and Tables | References | Related Articles | Metrics

Video atrial fibrillation detection using self-attentional anti-interference network

CHEN Jing, YANG Xue-zhi, CHEN Jing, LIU Xue-nan

2023, 44(2): 313-323. DOI: 10.11996/JG.j.2095-302X.2023020313

Abstract ( 199 )

HTML ( 2 )

PDF (1784KB) ( 49 )

Early detection and diagnosis of atrial fibrillation is the key to reducing the risk of atrial fibrillation and complications. Although the video photoplethysmography (VPPG) technology provides a new approach to atrial fibrillation screening, it is susceptible to motion interference in real-world scenarios. When the existing VPPG atrial fibrillation detection method encounters motion interference, the pulse signal will be distorted, resulting in misjudgment. To solve the above problems, an anti-interference video atrial fibrillation detection model was proposed. The model employed an attention encoder network to extract robust spiking latent features from spiking signals containing motion disturbances. A radial basis classification network then performed atrial fibrillation detection based on these latent features. The attention encoder mapped complex impulse signals into high-dimensional subspaces, focusing on effective information and extracting robust latent features. Furthermore, Radial Basis Classification Network enhanced atrial fibrillation recognition ability under the supervision of atrial fibrillation labels and output reliable results. Experiments were carried out on a self-built dataset with 200 testers, and the results show that the proposed model performed well in various scenarios. In static scenes, the detection accuracy was 8.1% higher than the optimal comparison algorithm, and the sensitivity was 7.5% higher. In dynamic scenes, where the accuracy of the comparison algorithms was greatly reduced, the accuracy of the proposed model was improved by 16.5%, and the specificity was improved by 18.3%. The model demonstrated good anti-motion interference ability, effectively eliminating the influence of motion interference and improving the detection accuracy of video atrial fibrillation in real scenes.

Figures and Tables | References | Related Articles | Metrics

X-ray image rotating object detection based on improved YOLOv7

CHENG Lang, JING Chao

2023, 44(2): 324-334. DOI: 10.11996/JG.j.2095-302X.2023020324

Abstract ( 527 )

HTML ( 14 )

PDF (3256KB) ( 261 )

For prohibited items in X-ray images, an algorithm for the detection of rotating targets based on the improved YOLOv7 was proposed to address the challenges of accurate identification and localization, as well as the neglection of the directionality of the items. Firstly, an efficient attention network module was integrated into the original network to enhance the ability of the model to extract deep important features. Then, the feature fusion path of the extended efficient long-range attention network (E-ELAN) was improved, and the residual structure jump connection and 1×1 convolution were added between modules, allowing the network to extract richer item features. Finally, to tackle the problem of arbitrary placement direction of prohibited items in X-ray images, the angles were discretized using the dense coded label representation method, thereby improving the positioning accuracy of prohibited items. The experimental results revealed that the improved algorithm could achieve a detection accuracy of 91.2%, 92.6%, and 66.4% on HiXray, OPIXray, and PIDray datasets, respectively. Compared with the original YOLOv7 model, the results were improved by 20.2%, 10.6%, and 15.5%, respectively. The proposed algorithm could provide a valuable technical support for public security by effectively improving the accuracy of prohibited item detection in X-ray images.

Figures and Tables | References | Related Articles | Metrics

Steel surface defect detection based on improved YOLOv5 algorithm

CAO Yi-qin, WU Ming-lin, XU Lu

2023, 44(2): 335-345. DOI: 10.11996/JG.j.2095-302X.2023020335

Abstract ( 886 )

HTML ( 49 )

PDF (5197KB) ( 419 )

An improved YOLOv5 steel surface defects detection algorithm was proposed to address the one-stage detection network YOLOv5, such as inadequate feature extraction ability, limited receptive field, and insufficient feature fusion. A feature pyramid structure of SPP_Res with residual edges was proposed to speed up the training of the model and enhance the feature extraction ability of the model. Additionally, a multi-head self-attention mechanism (C3_MHSA) was added to optimize the network structure, focusing on the global receptive field of the model and extracting richer features of the target. Furthermore, a multi-layer fusion mechanism was introduced to further integrate shallow and deep features, taking into account more information on location, semantics, and details, thereby improving the detection accuracy of steel surface defects. The experimental results demonstrated that the improved YOLOv5 algorithm could exhibit excellent detection performance, and that the mAP on the NEU-DET datasets reached 74.1%, which was 3.4% higher than that of the original YOLOv5 algorithm, 4.0% higher than that of the YOLOX algorithm, 8.6% higher than that of YOLOv3 algorithm, and 23.4% higher than that of the SSD algorithm. The improved YOLOv5 network could detect steel surface defects more accurately than YOLOv5 with similar detection speed, while outperforming other mainstream algorithms in both accuracy and speed.

Figures and Tables | References | Related Articles | Metrics

Research on weed detection in vegetable seedling fields based on the improved YOLOv5 intelligent weeding robot

ZHANG Wei-kang, SUN Hao, CHEN Xin-kai, LI Xu-bing, YAO Li-gang, DONG Hui

2023, 44(2): 346-356. DOI: 10.11996/JG.j.2095-302X.2023020346

Abstract ( 563 )

HTML ( 9 )

PDF (4847KB) ( 292 )

Accurate detection of weeds is a key technology for developing automated weeding equipment. To address the problems of high detection complexity and poor robustness resulting from the complex distribution and variety of weeds, we proposed a weed detection approach for vegetable seedling based on the improved YOLOv5 algorithm and image processing, implemented on a self-developed mobile robot platform. The weed detection complexity was reduced by indirectly detecting weeds through identifying vegetables, thus improving the detection accuracy and robustness. The convolutional block attention module (CBAM) attention module was added to the backbone feature extraction network of the YOLOv5 object detection algorithm to enhance the focus of the network on vegetable targets, and the Transformer module was added to enhance the global information capture capability. The results showed that the average detection accuracy of the improved YOLOv5 algorithm for vegetable targets could reach 95.7%, which was increased by 5.8%, 6.9%, 10.3%, 13.1%, 9.0%, 5.2%, and 3.2% compared with Faster R-CNN, SSD, EfficientDet, RetinaNet, YOLOv3, YOLOv4, and YOLOv5, respectively. The average detection time of the algorithm for a single run was 11 ms, indicating good real-time performance. The method defined green plants outside the vegetable border as weeds, and combined the extreme green (ExG) with the OTSU threshold segmentation method to segment weeds from the soil background. Finally, the weed connectivity domain was marked, followed by outputting the weed plasmids and detection frames. The proposed method could provide a technical reference for automated precision weeding in agriculture.

Figures and Tables | References | Related Articles | Metrics

Research on BIM model of railway track structure based on IFC standard extension

HE Qing, JING Chuan-yu, GAO Tian-ci, WANG Ping

2023, 44(2): 357-367. DOI: 10.11996/JG.j.2095-302X.2023020357

Abstract ( 381 )

HTML ( 4 )

PDF (1289KB) ( 163 )

Building information modeling (BIM) technology plays a pivotal role in enhancing the engineering design level of the railway industry. One major challenge in applying BIM technology in the track design stage is how to make full use of its information modeling ability to realize the digital transmission of design information. To address this challenge, the industrial foundation class (IFC) standard was employed to integrate the track structure engineering information into the BIM model. However, given the lack of entity information in the domain layer of the IFC standard architecture, this paper proposed to extend and define the track field using entity extension and user-defined PESTs, thereby constructing the basic data architecture system of track structure. The entity extension included the extension of spatial structure unit, combination, component, and part, while the user-defined PESTs extended identity information, positioning information, and technical information. The new track entity and attribute set expression system was constructed by extending relational entities and combining them with the IFC standard expression mechanism. On this basis, a modeling method for the BIM model of track structure was further proposed. Finally, through a ballast-less track case, the practicability of the IFC expansion and modeling method was verified, demonstrating its practical engineering significance in improving the completeness and transitivity of BIM model information of track structure in the design stage.

Figures and Tables | References | Related Articles | Metrics

Compliance checking approach for BIM structural model under semantic web

ZHANG Ji-song, YU Ze-han, LI Hai-jiang

2023, 44(2): 368-379. DOI: 10.11996/JG.j.2095-302X.2023020368

Abstract ( 394 )

HTML ( 11 )

PDF (1021KB) ( 131 )

In view of the labor-intensive and low automation of the building information modeling (BIM) compliance checking process, a semantic web-based compliance checking approach for BIM structural model was proposed. The approach included three sub-modules: specification translation, BIM model processing, and compliance checking. The specification translation sub-module can transform semi-structured design specification clauses into structured knowledge, allowing for flexible query and reasoning. The BIM model processing sub-module can realize the extraction, transformation, and mapping of BIM model information. The compliance checking sub-module can implement the rules and generate checking reports. The method started from the way that the first order predicate logic represented the structural design specification. By deploying the ontology construction tool protégé, the BIM structural model information and design specification clauses were translated into the ontology knowledge base through mapping and semantic web rule language (SWRL), thus enabling the query, reasoning, and compliance checking of the BIM structural model. Finally, the frame-structure project verified the effectiveness and feasibility of the approach. This study could provide a reference method for automatic compliance checking of structural design based on BIM models.

Figures and Tables | References | Related Articles | Metrics

Intelligent charging pile design based on SAPAD-AHP

LIU Zong-ming, WANG Ze-qi

2023, 44(2): 380-388. DOI: 10.11996/JG.j.2095-302X.2023020380

Abstract ( 369 )

HTML ( 21 )

PDF (3991KB) ( 162 )

The purpose of this paper is to explore the application of the SAPAD-AHP method in product design, and to propose corresponding research method models and design processes. This could provide theoretical guidance and practical reference for the design of new energy charging piles, and effectively address the problem of urban intelligent charging piles failing to meet the experience needs of users in the context of new infrastructure construction. Taking the owners of new energy automobiles as the research object, the semiotics approach of product architecture design (SAPAD) model was used to study their behavior process. Through clustering analysis at multiple levels of meaning, the cluster in general sense was studied qualitatively. Combined with the weight analysis of the analytic hierarchy process (AHP) method, the multi-level core meaning cluster was obtained quantitatively. Then, through the remapping analysis of the core meaning cluster-product, the core design needs of product modeling and interaction were obtained. By breaking down key user behaviors, studying their real needs and analyzing core design needs, an improved design scheme for intelligent charging piles that could meet users′ needs was proposed. From the perspective of design semiotics, the application of the SAPAD-AHP method to the design of intelligent charging piles could effectively implement the accurate mapping from the analysis of user behavioral needs to the design of product functions. To some extent, this study could address the shortcomings of the existing charging pile design, such as a lack of insight into users′ behaviors and inaccurate grasp of design requirements, and provide a new idea for the research and development of smart charging pile design in China.

Figures and Tables | References | Related Articles | Metrics

Evaluation method of virtual museum interface layout aesthetic with visual cognitive characteristics

PEI Hui-ning, WEN Zhi-qiang, HUANG Xue-qin, TAN Zhao-yun, ZHANG Xin-xin

2023, 44(2): 389-398. DOI: 10.11996/JG.j.2095-302X.2023020389

Abstract ( 310 )

HTML ( 4 )

PDF (920KB) ( 121 )

To address the lack of quantitative research on users′ visual cognitive characteristics in the evaluation of human-machine interface layout, an evaluation method for virtual museums′ interface layout aesthetics was presented, taking into account human visual cognitive characteristics. Firstly, symmetry, density, simplicity, order and dominance were selected as the evaluation indexes, and the abstract rectangular layout image was taken as the sample to evaluate the aesthetic of the interface layout. Secondly, the eye tracking technique was used to test the aesthetic of the interface layout, and the results of the users′ visual cognitive physiological data were obtained. Thirdly, based on the grey theory, an improved grey H-convex correlation model was constructed to quantify the mapping between users′ visual cognitive characteristics and the aesthetic of human-machine interface layout. Finally, the method was applied to the analysis of human-machine interface layout design in virtual museums, the validity of the evaluation method and its ability to judge the positive and negative correlation of the aesthetic index of man-machine layout were verified.

Figures and Tables | References | Related Articles | Metrics

Effects of map symbols and emotional valence on AR map users′ spatial memory

CHEN Yan, ZHANG Qiong-wen, WANG Jia-qi, SHUAI Mei-qiong, HU Jun

2023, 44(2): 399-407. DOI: 10.11996/JG.j.2095-302X.2023020399

Abstract ( 276 )

HTML ( 2 )

PDF (1108KB) ( 90 )

This paper delved into the effects of map symbols′ abstraction level and emotional valence on the spatial memory of mobile augmented reality (AR) map users, and offered suggestions for design optimization. A total of 35 subjects were assessed by the Mental Cutting Test, and then the emotions were induced by the combination of Chinese Affective Picture System and Chinese Affective Music System. The data of 30 subjects′ Profile of Mood States (POMS), Brief Mood Introspection Scale (BMIS), and Egocentric Pointing Judgments were collected and analyzed according to the scale scores and experimental results. The results indicated that the abstraction level of map symbols had a significant impact on spatial memory, with icons having the shortest learning times. However, emotional valence had no significant impact on spatial memory. Furthermore, the abstraction level of map symbols and emotional valence were found to have interactive effects on the learning time and task error rate of spatial memory. The research findings will aid in the design and assessment of comparable systems.

Figures and Tables | References | Related Articles | Metrics

Experimental study on perception of color encoding based on information interface statistical task

GUO Qi, WU Jin-chun, XUE Cheng-qi

2023, 44(2): 408-414. DOI: 10.11996/JG.j.2095-302X.2023020408

Abstract ( 198 )

HTML ( 3 )

PDF (819KB) ( 85 )

In order to effectively enhance users′ decision-making efficiency for color encoding in visual statistics tasks, it is necessary to first investigate the perceptual problem of the human visual system for color encoding in visualization interfaces. Therefore, the concept of significant difference was introduced into the color encoding research, leading to the creation of a set of new color spaces for visual statistics tasks. The effect of color difference threshold on the perceptual performance of visual statistics tasks was then examined through typical visual statistics task experiments, so as to summarize the perceptual accuracy and laws of color coding for visual statistics tasks. The results revealed that: firstly, in the correlation recognition task and the mean comparison task, the perceptual accuracy of the human visual system was more accurate than that predicted by previous color models, indicating that the difference threshold (JND) selection for color encoding could be smaller. Secondly, when different color axes were selected for color encoding, the recognition performance of the correlation recognition task was higher with lightness encoding. Finally, under different basic correlation coefficients and number of point groups, the accuracy of correlation recognition and mean comparison followed the Weber linear function, which reflected the observers′ ability to process information entropy when performing visual statistical tasks. The above findings provide design guidance for information visualization designers, and offer a quantitative basis for color coding of visual interface information from the perspective of psychophysics.

Figures and Tables | References | Related Articles | Metrics

Published as 2, 2023

2023, 44(2): 414.

Abstract ( 73 )

PDF (77547KB) ( 63 )

Table of Contents