Loading...
Welcome to Journal of Graphics share: 

Table of Contents

    For Selected: Toggle Thumbnails
    Cover
    Cover of issue 5, 2024
    2024, 45(5): 1. 
    PDF 77     117
    Related Articles | Metrics
    Contents
    Table of Contents for Issue 5, 2024
    2024, 45(5): 2. 
    PDF 29     50
    Related Articles | Metrics
    Review
    Optimizing the visual effects of 3D rendering in medical imaging: a technical review
    XU Dandan, CUI Yong, ZHANG Shiqian, LIU Yucong, LIN Yusong
    2024, 45(5): 879-891.  DOI: 10.11996/JG.j.2095-302X.2024050879
    HTML    PDF 147     111

    Medical imaging is a critical field in medicine, utilizing techniques such as magnetic resonance imaging (MRI), computed tomography (CT) scans, ultrasound (US), X-rays, and positron emission tomography (PET) scans. These medical images are generated through various imaging techniques. In this context, 3D rendering has emerged as a pivotal visualization tool that plays a significant role in visualizing anatomical structures, enabling accurate diagnosis, effective treatment planning, and precise surgical interventions. This analysis examined the current state of research on optimizing visualization effects in 3D rendering technology within medical imaging. First, two fundamental 3D rendering techniques were introduced, providing a foundation for understanding the technological landscape. Following this, the discussion examined recent advancements in visualization optimization, focusing on two main areas: technical optimization and framework optimization. Technical optimization involved refining algorithms and methods to improve image quality and rendering speed. Framework optimization, on the other hand, focused on the integration of rendering technologies into broader software systems to enhance performance and usability. A comparative analysis of various optimization techniques was presented, highlighting their characteristics, application scenarios, and respective strengths and weaknesses. This comparison served as a reference for researchers and practitioners in selecting appropriate techniques for their specific needs. Evaluating the effectiveness of 3D rendering was another crucial aspect covered in this analysis. Both subjective and objective evaluation methods were explored to provide a comprehensive assessment of visualization quality. The analysis also discussed potential challenges posed by technological advancements, such as increased algorithm complexity, decreased rendering efficiency, and suboptimal real-time performance, as well as exploring potential solutions. Future directions for optimizing 3D rendering and visualization effects were explored.

    Figures and Tables | References | Related Articles | Metrics
    Image Processing and Computer Vision
    A refined YOLOv8-based algorithm for lightweight pavement disease detection
    HU Fengkuo, YE Lan, TAN Xianfeng, ZHANG Qinzhan, HU Zhixin, FANG Qing, WANG Lei, MAN Xiaofeng
    2024, 45(5): 892-900.  DOI: 10.11996/JG.j.2095-302X.2024050892
    HTML    PDF 211     219

    Road surface defect detection is a crucial task for repairing road damage and ensuring driving safety. To address the issues of low detection accuracy, high costs, large model parameters, and the difficulty in applying existing road surface defect detection algorithms to mobile terminal devices, a lightweight detection algorithm, YOLOv8n-GSBP, based on the improved YOLOv8n model, was proposed. Firstly, the C2f-GhostNetv2 module was introduced into the backbone network to maintain detection accuracy while achieving model lightweight. Additionally, the SimAM module was added after the SPPF module to enhance the network’s ability to extract road surface defect features and distinguish them from background environmental features. Secondly, the neck network was replaced with the BiFPN structure, and the model’s multi-scale feature fusion capability was enhanced while addressing significant differences in road surface defect scales to improve precision and robustness. Finally, the head was improved by the parameter-sharing principle, and the spatial channel reconstruction convolutional module SCConv was introduced to achieve lightweight improvement of the detection head while reducing model parameters and computational complexity. The experimental results on the RDD2022 dataset showed that the mAP50 of YOLOv8n-GSBP road surface disease detection method was 0.3% higher than that of the YOLOv8n; however, the parameters were reduced by 55.6% and the computational complexity was reduced to 36.7%. Furthermore, through comparisons with other mainstream object detection algorithms, we further validated both effectiveness and superiority of our proposed algorithm.

    Figures and Tables | References | Related Articles | Metrics
    TFD-YOLOv8: a transmission line foreign object detection method
    WANG Yaru, FENG Lilong, SONG Xiaoke, QU Zhuo, YANG Ke, WANG Qianming, ZHAI Yongjie
    2024, 45(5): 901-912.  DOI: 10.11996/JG.j.2095-302X.2024050901
    HTML    PDF 161     177

    Foreign object detection based on UAV aerial images is an important aspect for intelligent inspection of transmission lines. The YOLO target detection algorithm is high in accuracy and speed, making it the current mainstream algorithm. However, when carrying out transmission line foreign object detection, due to the variable scale and insignificant features of foreign object targets, problems such as misdetection and omission detection would arise. A YOLOv8 model (transmission line foreign detection-YOLOv8, TFD-YOLOv8) was proposed for transmission line foreign object detection. A two-branch downsampling module was constructed in the YOLOv8s neck network to intercept the scale-related detail information easily lost during the downsampling process, achieving the efficient fusion of semantic and detail information and improving the information consistency of feature maps at different scales. Then, a mix-enhancement attention module was inserted into the backbone network to simultaneously extract global and local features of the image, generating spatial attention and channel attention, respectively, and resulting in a mix-enhancement attention including local, global, spatial, and channel information. This enhanced the network’s ability to capture the key features of the targe. The experimental results showed that compared with the baseline model, the proposed method improved the average detection accuracy by 6.7%, and the accuracy and recall by 12.9% and 5.1%, respectively. This method demonstrated advantages in terms of detection accuracy and complexity compared with several existing target detection methods.

    Figures and Tables | References | Related Articles | Metrics
    Photovoltaic cell surface defect detection based on DBBR-YOLO
    LIU Yiyan, HAO Tingnan, HE Chen, CHANG Yingjie
    2024, 45(5): 913-921.  DOI: 10.11996/JG.j.2095-302X.2024050913
    HTML    PDF 126     81

    A method for detecting surface defects of photovoltaic (PV) cells based on DBBR-YOLO was proposed to address the difficulties in defect feature extraction and the issues of real-time detection and accuracy. Firstly, a diverse branch block (DBB) was incorporated into the C2f module of the YOLOv8n Backbone section to introduce diversified feature extraction paths, enhancing the capability of feature extraction. Secondly, the Neck section of the model was fused with Gold-YOLO to achieve global information aggregation and feature fusion at different hierarchical levels, improving the efficiency of information interaction between feature maps and enhancing the feature expression capability of the model. Finally, the SimAM attention mechanism was introduced to improve the feature expression capability, thereby enhancing the model’s ability to detect small defects or targets. Experiments conducted on five types of PV cell surface defects demonstrated that the improved DBBR-YOLO model achieved an mAP50 value of 93.1%, a 3.7% improvement over YOLOv8n, with an FPS value of 158.0. The performance of the model in terms of accuracy and speed can meet the requirements for real-time detection and accuracy, making it suitable for practical application scenarios of detecting PV cell surface defects.

    Figures and Tables | References | Related Articles | Metrics
    Video anomaly detection based on attention feature fusion
    WU Peichen, YUAN Lining, HU Hao, LIU Zhao, GUO Fang
    2024, 45(5): 922-929.  DOI: 10.11996/JG.j.2095-302X.2024050922
    HTML    PDF 90     86

    Currently, feature fusion methods based on attention mechanisms, such as multi-head self-attention, largely rely on the correlation between features, with limited cross-domain fusion capabilities. Additionally, due to existing domain differences among various features, the spatiotemporal perception capability of the fused features is insufficient. To address the insufficient cross-domain expression capability of RGB and optical flow features and the weak spatiotemporal perception capability of the fused features, a video anomaly detection method based on attentional feature fusion was proposed. Firstly, a lightweight attentional feature fusion module (LAFF) was employed to construct the fusion mechanism, combining RGB and optical flow features, enhancing the feature expression capabilities while reducing the network’s parameter count and improving anomaly detection algorithm performance. Then, in the global spatiotemporal perception stage, a diverse branch block (DBB) was utilized to enhance the spatiotemporal perception capabilities of the features, while considering computational complexity and detection effectiveness. Finally, the proposed method achieved a recognition rate of 99.85% on the UCSD Ped2 dataset and demonstrated similarly strong performance on the CUHK Avenue and LAD 2000 datasets, validating the effectiveness of the approach.

    Figures and Tables | References | Related Articles | Metrics
    The vehicle parts detection method enhanced with Transformer integration
    ZHAI Yongjie, LI Jiawei, CHEN Nianhao, WANG Qianming, WANG Xinying
    2024, 45(5): 930-940.  DOI: 10.11996/JG.j.2095-302X.2024050930
    HTML    PDF 52     58

    To effectively address issues such as false detections and missed detections caused by insufficient feature extraction and inadequate utilization of candidate boxes in vehicle component detection models, an improved Transformer-based method for vehicle component detection was proposed. Firstly, by combining multi-head self-attention and bi-layer routing attention, a key region multi-head self-attention (KR-MHSA) mechanism was introduced. Secondly, the final layer of ResNet in the baseline model (Mask R-CNN) was integrated with KR-MHSA using residual fusion, enhancing the basic feature extraction capabilities of the model. Finally, the improved Swin Transformer was employed for feature learning on the candidate boxes generated by the model, enabling the model to better understand the differences and similarities between various candidate boxes. Experiments conducted on a constructed dataset of 59 vehicle component categories demonstrated that the proposed model outperformed other state-of-the-art instance segmentation models in both detection and segmentation performance. Compared to the baseline model, the detection accuracy improved by 4.47%, and the segmentation accuracy improved by 4.4%. This effectively resolved the issues of insufficient feature extraction and inadequate utilization of candidate boxes in vehicle component detection, leading to more accurate and efficient replacement of damaged parts by insurance companies, thus improving claims processing efficiency.

    Figures and Tables | References | Related Articles | Metrics
    Research on multi-scale remote sensing image change detection using Swin Transformer
    LIU Li, ZHANG Qifan, BAI Yuang, HUANG Kaiye
    2024, 45(5): 941-956.  DOI: 10.11996/JG.j.2095-302X.2024050941
    HTML    PDF 90     78

    Due to the complexity of terrain information and the diversity of change detection data, it is difficult to ensure the adequacy and effectiveness of feature extraction in remote sensing images, resulting in low reliability of detection results obtained by change detection methods. Although convolutional neural networks are widely applied in remote sensing change detection due to their advantage of effectively extracting semantic features, the inherent locality of convolutional operations limits the receptive field, making it difficult to capture global spatiotemporal information, thus limiting the modeling of long-range dependencies in the feature space. To capture long-distance semantic dependencies and extract deep global semantic features, a multi-scale feature fusion network SwinChangeNet based on the Swin Transformer was designed. Firstly, SwinChangeNet utilized a twin multi-level Swin Transformer feature encoder for long-range context modeling. Secondly, a feature difference extraction module was introduced into the encoder to calculate the multi-level feature differences before and after changes at different scales, and then the multi-scale feature maps were fused through an adaptive fusion layer. Finally, residual connections and channel attention mechanisms were introduced to decode the fused feature information, thereby generating a complete and accurate change map. Compared with seven classic and cutting-edge change detection methods on two publicly available datasets, CDD and CD-Data_GZ, the proposed model demonstrated the best performance in both datasets. In the CDD dataset, compared with the second-best performing model, the F1 score increased by 1.11% and the accuracy by 2.38%. The proposed model outperformed the others in the CD-Data_GZ dataset. Compared to the second best-performing model, the F1 score, accuracy, and recall increased by 4.78%, 4.32%, and 4.09%, respectively, showing significant improvements. The comparative experimental results demonstrated that the proposed model has superior detection performance. The stability and effectiveness of each improved module in the model were also validated through the ablation experiment. In conclusion, the model proposed in this article focused on the task of remote sensing image change detection, introducing the Swin Transformer structure. This enabled the network to more effectively encode local and global features of remote sensing images, resulting in more accurate detection results, while ensuring that the network converges efficiently on datasets with a wide variety of land features.

    Figures and Tables | References | Related Articles | Metrics
    Fusing prior knowledge reasoning for surface defect detection
    JIANG Xiaoheng, DUAN Jinzhong, LU Yang, CUI Lisha, XU Mingliang
    2024, 45(5): 957-967.  DOI: 10.11996/JG.j.2095-302X.2024050957
    HTML    PDF 69     61

    Current surface defect detection methods based on deep learning mainly focus on the individual identification of defect instances, considering defect detection only from the aspect of region features. However, this method overlooks the high-level relation between defects, which will inevitably lead to defect detection errors. To address the above problems, a surface defect detection network (PKR-Net) based on prior knowledge reasoning was proposed. Specifically, PKR-Net mainly consists of two parts, namely, the explicit knowledge reasoning module (EKRM) and the implicit knowledge reasoning module (IKRM). EKRM constructed an explicit relation graph (ERG) to capture the global co-occurrence relation between defects in the dataset, thereby obtaining co-occurrence relation features. Meanwhile, IKRM constructed an implicit relation graph (IRG) to capture the local spatial relation between defects in the image, thereby obtaining spatial relation features. Finally, the co-occurrence relation features and spatial relation features were fused and re-fed into the classification and regression layers to improve detection performance. Experimental verification was conducted on the industrial defect datasets Textile, NEU-DET and GC10-DET. The experimental results showed that the mAP of the proposed network model improved by 14.8%, 8.2%, and 18.9%, respectively, compared with the baseline model Faster RCNN. Compared with other defect detection models, the proposed model can achieve better detection performance, verifying its effectiveness.

    Figures and Tables | References | Related Articles | Metrics
    Feature fusion and inter-layer transmission: an improved object detection method based on Anchor DETR
    ZHANG Dongping, WEI Yangyue, HE Shuji, XU Yunchao, HU Haimiao, HUANG Wenjun
    2024, 45(5): 968-978.  DOI: 10.11996/JG.j.2095-302X.2024050968
    HTML    PDF 44     41

    Object detection is a crucial task in the field of computer vision, aiming to accurately identify and locate objects of interest in images or videos. An improved object detection algorithm was proposed by incorporating feature fusion, optimizing the inter-layer transmission method of the encoder, and designing a random jump retention method. These improvements addressed the limitations of general Transformer models in object detection tasks. Specifically, to counteract the issue of insufficient object information perception due to the computational constraints limiting Transformer vision models to a single layer of features, a convolutional attention mechanism was utilized to achieve effective multi-scale feature fusion, thereby enhancing the capability of object recognition and localization. By optimizing the transfer mode between encoder layers, each encoder layer effectively transmitted and learned more information, reducing information loss between layers. Additionally, to address the problem where predictions in the intermediate stages of the decoder outperformed those in the final stage, a random jump retention method was designed to improve the model’s prediction accuracy and stability. Experimental results demonstrated that the improved method significantly enhanced performance in object detection tasks. On the COCO2017 dataset, the model’s AP reached 42.3%, and the AP for small targets improved by 2.2%; on the PASCAL VOC2007 dataset, the model’s AP improved by 1.4%, and the AP for small targets improved by 2.4%.

    Figures and Tables | References | Related Articles | Metrics
    Research on defect detection of transmission line fittings based on improved YOLOv8 and semantic knowledge fusion
    LI Gang, CAI Zehao, SUN Huaxun, ZHAO Zhenbing
    2024, 45(5): 979-986.  DOI: 10.11996/JG.j.2095-302X.2024050979
    HTML    PDF 60     54

    To address issues such as the uneven distribution of defect samples among classes and the difficulty in extracting tiny features of defects in the task of detecting defects of transmission line bolts and fixtures, a defect detection method for transmission line bolts and fixtures was proposed based on the improvement of YOLOv8 and semantic knowledge fusion. First, the semantic correlation was established by analyzing the relationship between the defective types of bolt fittings in the data samples and the types of fittings carried by that bolt. Then, the BiFusion and RepBlock modules were introduced into the Neck part of the YOLOv8 model to enhance its feature extraction capability. Second, the Loss function of the weights was corrected using an improved fusion of semantic knowledge, further improving the accuracy of the training model and reducing the occurrence of misdetection. Finally, baseline selection experiments, ablation experiments, hyper-parameter experiments, and comparative experiments were conducted, respectively. The experimental results showed that compared with the Baseline model, the improved YOLOv8 method increased the mean average precision (mAP) by 4.0% and improved the accuracy of the key less sample classes by 24.6%, effectively enhancing the defect detection performance for transmission line bolted fittings. The proposed semantic correlation establishment and semantic knowledge fusion method also demonstrated a certain degree of generalizability, providing new methodological support for UAV-based intelligent inspection of transmission lines.

    Figures and Tables | References | Related Articles | Metrics
    Road defect detection algorithm based on improved YOLOv7-tiny
    XIE Guobo, LIN Songze, LIN Zhiyi, WU Chenfeng, LIANG Lihui
    2024, 45(5): 987-997.  DOI: 10.11996/JG.j.2095-302X.2024050987
    HTML    PDF 79     55

    To address the current issues in road damage detection methods, such as large parameter sizes, poor performance in detecting small targets, and high rates of false positives and missed detections, an improved YOLOv7-tiny-based road defect detection algorithm was proposed. The ELAN-SimAM-D structure was designed by introducing depthwise separable convolution (DSC) and a parameter-free attention mechanism, which could reduce computational and parameter sizes to achieve a lightweight model while enhancing the model’s feature extraction and fusion capabilities. The SPPAda structure, which incorporated adaptive exponential pooling and adaptive fusion, was introduced as a spatial pyramid pooling structure to enhance the retention of road defect information and improve detection accuracy. A new P2 small target network layer was added to strengthen the detection capability for smaller target defects, improving detection accuracy. A new loss function, NWD-EIOU, was designed to replace the original CIOU loss function, improving the localization accuracy for small targets. Experimental results showed that compared to the original YOLOv7-tiny algorithm, the improved YOLOv7-tiny algorithm achieved an mAP@0.5 of 83.14% on a self-built experimental dataset, an increase of 3.50%, with a 4.96% improvement in recall rate, and a 33.84% reduction in the model’s parameter size, meeting the requirements for road defect detection.

    Figures and Tables | References | Related Articles | Metrics
    Visually meaningful image encryption based on twin physically unclonable functions and compressed sensing
    GAO Xianwei, GUO Weikai, CHENG Yixuan, YUAN Ye, SUO Zhufeng
    2024, 45(5): 998-1007.  DOI: 10.11996/JG.j.2095-302X.2024050998
    HTML    PDF 43     33

    The visually meaningful image encryption (VMIE) scheme has attracted the attention from researchers due to its ability to conceal encrypted images in plaintext images without raising suspicion from attackers. In order to enhance the performance of the system in terms of capacity and security, many VMIE schemes based on compressed sensing have been proposed. However, it remains difficult to solve the key management problem effectively in these schemes. Therefore, a VMIE scheme based on twin physically unclonable functions and compressed sensing was proposed. First, the complete key seed was embedded in the carrier image to achieve public network key exchange while avoiding additional communication consumption. With the key seed being encrypted and encoded, both security and robustness were guaranteed. Subsequently, the key seed was iteratively extended using a hash algorithm to generate the key stream. The experimental results demonstrated that the scheme can achieve the balance of security, embedding capacity, and robustness.

    Figures and Tables | References | Related Articles | Metrics
    Computer Graphics and Virtual Reality
    Multi-view stereo network reconstruction with feature alignment and context-guided
    XIONG Chao, WANG Yunyan, LUO Yuhao
    2024, 45(5): 1008-1016.  DOI: 10.11996/JG.j.2095-302X.2024051008
    HTML    PDF 75     126

    To address the problem of poor reconstruction of small features and edge areas in 3D reconstruction, a multi-view 3D reconstruction network based on feature alignment and context-guided was proposed, namely AGA-MVSNet (alignment and context guidance MVSNet). First, a feature alignment module (FA) and a feature selection module (FS) were constructed to combine different levels of the feature pyramid. The features were first aligned and then fused to enhance the feature extraction capabilities of small-sized objects and edge areas. Subsequently, a context guidance module was incorporated into the cost volume regularization to fully utilize surrounding information and solve the problem of poor correlation between cost volumes, thereby improving the accuracy and completeness of three-dimensional reconstruction, with only a slight increase in memory consumption. Finally, experiments were conducted on the DTU dataset. Experimental results demonstrated that the proposed method improved the accuracy by 2.2% and the overall reconstruction quality by 2.5% compared with the benchmark network CasMVSNet. In addition, the performance on the Tanks and Temples dataset was also excellent compared with some known methods, and good point cloud effects were also generated on the BlendedMVS dataset.

    Figures and Tables | References | Related Articles | Metrics
    A short chromosome classification method based on spatial attention and texture enhancement
    PENG Wen, LIN Jinwei
    2024, 45(5): 1017-1029.  DOI: 10.11996/JG.j.2095-302X.2024051017
    HTML    PDF 43     43

    Chromosome classification is a crucial task in karyotype analysis. Despite the significant achievements made by residual neural networks in chromosome classification, the classification still presents challenges due to the short length, difficult-to-identify classification features, and high morphological similarity of certain chromosomes. To address this issue, a spatial information attention and texture enhancement network (SIATE-Net) model was proposed for chromosome classification. The SIATE-Net model utilized the Inception_ResNetV2 model as its backbone network to extract deep features of chromosomes. By introducing self-attention mechanisms and depth-wise separable convolution, the model could better focus on and retain the spatial information of short chromosomes. The short length of certain chromosomes often leads to confusion in banding information. To mitigate this issue, the model integrated a texture enhancement mechanism to amplify the differences between chromosomes, providing the model with more discriminative features for classification. The SIATE-Net model was validated on both private and public datasets, demonstrating superior classification performance compared to other methods, especially in classifying short chromosomes. On the private dataset, the SIATE-Net model achieved the best overall classification accuracy of 98.05%, with a high accuracy of 97.42% for short chromosomes. On the public dataset, the overall classification accuracy of the SIATE-Net model was 98.95%, with short chromosomes reaching an accuracy of 98.51%. Experimental results demonstrated that the targeted self-attention module, depth-wise separable convolution, and texture enhancement module effectively addressed the classification of short chromosomes without compromising overall classification accuracy.

    Figures and Tables | References | Related Articles | Metrics
    A free viewpoint synthesis method based on differentiable rendering
    ZHU Jie, SONG Ying
    2024, 45(5): 1030-1039.  DOI: 10.11996/JG.j.2095-302X.2024051030
    HTML    PDF 35     45

    To address the challenges posed by highly variable lighting conditions and camera parameters in uncontrolled environments affecting free viewpoint synthesis, an approximate differentiable deferred inverse rendering pipeline (ADDIRP) was proposed. This pipeline incorporated a physics-based camera model to accurately simulate the optical imaging process of the camera. Firstly, we proposed creating photometric and geometric camera models based on the input images and corresponding poses. The photometric camera model was represented by learnable parameters such as exposure and white balance, while the geometric camera model was represented by learnable intrinsic and extrinsic parameters. Next, the components of the pipeline were optimized using image space loss between the rendered and target images, enhancing the robustness of the inverse rendering pipeline to complex lighting and roughly captured images. Finally, our approach generated 3D content reconstructions compatible with traditional graphics engines. Experimental results demonstrated that the ADDIRP outperformed existing methods on real-world datasets, achieving superior visual perception consistency on synthetic datasets while maintaining comparable synthesis quality.

    Figures and Tables | References | Related Articles | Metrics
    BIM/CIM
    Small-target worker detection on construction sites
    LI Jianhua, HAN Yu, SHI Kaiming, ZHANG Kejia, GUO Hongling, FANG Dongping, CAO Jiaming
    2024, 45(5): 1040-1049.  DOI: 10.11996/JG.j.2095-302X.2024051040
    HTML    PDF 62     58

    The accurate detection of workers using construction site surveillance videos or images can support the intelligent management of construction safety. However, the long-distance surveillance causes workers to appear as small targets in images, and the complex and changing environment further complicates worker detection. To address this problem, a small-target worker detection method integrating an improved YOLO model with the frame difference method was proposed. On the one hand, the stationary workers were detected through the improved YOLOv5 model, which introduced slicing aided inference to obtain the clearer features of small-target workers, added small-target detection heads to ensure feature completeness, and employed the ECA mechanism to improve the detection performance. On the other hand, the moving workers with weak image features were detected using the frame difference method, to some extent compensating for the shortcomings of image detection. The proposed method was validated on a self-constructed dataset, and the results showed that the F1-Score and mAP50 of the improved YOLOv5 model improved by 11.3% and 12.5%, respectively. Additionally, the detection rate of small-target workers using the integrated method increased by 3.6% to 84.2%, with a FPS of 6 frames·per second. Thus, the proposed method can better satisfy the needs of worker detection on construction sites.

    Figures and Tables | References | Related Articles | Metrics
    Research on efficient detection model of tunnel lining crack based on DCNv2 and Transformer Decoder
    SUN Jilong, LIU Yong, ZHOU Liwei, LU Xin, HOU Xiaolong, WANG Yaqiong, WANG Zhifeng
    2024, 45(5): 1050-1061.  DOI: 10.11996/JG.j.2095-302X.2024051050
    HTML    PDF 84     75

    To address the problems of low recognition accuracy, slow detection speed, and large parameter quantities caused by the random and dense distribution of cracks in tunnel linings and low resolution of annotation boxes in existing models, the YOLOv8 network framework was improved based on the Deformable Convolution Network version 2 (DCNv2) and end-to-end Transformer Decoder to propose a lining crack detection model DTD-YOLOv8. Firstly, DCNv2 was added to fuse the YOLOv8 backbone convolutional network C2f, enabling the model to accurately and quickly perceive crack deformation features. At the same time, the Transformer Decoder replaced the YOLOv8 detection head to achieve a complete object detection process within an end-to-end framework, thereby eliminating the computational consumption caused by the Anchor-free processing mode. A self-built crack dataset was used to compare and verify seven detection models, including SSD, Faster-RCNN, RT-DETR, YOLOv3, YOLOv5, YOLOv8, and DTD-YOLOv8. The results indicated that the F1 score and mAP@50 of DTD-YOLOv8 reached 87.05% and 89.58%, respectively. Compared to the other six models, the F1 score increased by 14.16%, 7.68%, 1.55%, 41.36%, 8.20%, and 7.40%, while the mAP@50 increased by 28.84%, 15.47%, 1.33%, 47.65%, 10.14%, and 10.84%. The parameter count of the new model was only one-third of RT-DETR, and the detection speed for a single image was 16.01 ms, with an FPS of 65.46 frames per second, demonstrating a speed improvement over other comparative model. The DTD-YOLOv8 model can demonstrate efficient performance in meeting the requirements of crack detection tasks in operational tunnels.

    Figures and Tables | References | Related Articles | Metrics
    Digital Design and Manufacture
    A knowledge graph-based sequence planning method for helicopter components assembly
    JIANG Mingjie, ZHANG Weicai, RONG Haoming, ZHANG Junqi, HUANG Shaohua
    2024, 45(5): 1062-1070.  DOI: 10.11996/JG.j.2095-302X.2024051062
    HTML    PDF 69     68

    Due to the large number of helicopter components and the complex constraint relationships between them, traditional assembly sequence planning methods encounter the problems of exponential explosion and local optimal solution. To address the difficulties posed by the exponential increase of running time with the number of components and complex calculations of multi-constraint relationship matrix, knowledge graphs (KG) were introduced to establish intuitive semantic assembly information models. A KG -based method for planning the assembly sequence of helicopter components was proposed. Firstly, key assembly information such as structure information and constraint relationships was extracted based on 3D model analysis and knowledge reasoning. Secondly, an assembly information model in the form of KG was constructed based on ontology. Finally, a graph planning algorithm with feedback was employed to determine the assembly sequence of helicopter components from the KG. The priority relationships provided in the KG reduced the search space of the algorithm. Under the constraint of these priority relationships, the graph planning algorithm with feedback aimed to minimize the number of assembly direction changes and the number of assembly tool changes. It gradually planned the sequence, and fed back the planning results to avoid repeated searches. The middle piece of the helicopter mid-fuselage was used as an experimental object to verify the effectiveness of the proposed method. The proposed method achieved higher fitness for the assembly sequence and shorter solution times compared to heuristic algorithms.

    Figures and Tables | References | Related Articles | Metrics
    Industrial Design
    Research on design-driven product disruptive innovation methods
    YANG Pei, SONG Jiong, YANG Dongmei, BAI Renfei, CAO Guozhong
    2024, 45(5): 1071-1083.  DOI: 10.11996/JG.j.2095-302X.2024051071
    HTML    PDF 44     60

    Based on theoretical studies in design-driven innovation and disruptive innovation, a design-driven disruptive innovation model was proposed, thereby enhancing research on disruptive innovation methods in design. This model comprised three stages: meaning discovery, functional system adjustment, and product language expression. Firstly, based on SET, market trends were envisioned for new product meanings, and internal critiques were utilized to explore feasible product meanings. Then, research on target users utilized AD to map functional requirements into design parameters, which were compared with the original product technology to adjust and achieve disruptive innovation solutions. The new product meaning features and the parts corresponding to the improved and added technologies were identified for product meaning interpretation, and samples of these parts were selected using analogy reasoning. Finally, similarity analysis was conducted to determine the design prototype and extract its elements. Extenics was employed to optimize the product language expression, resulting in a disruptive innovation product language solution. The electric wheelchair design example validated the model’s feasibility, providing a reference for other design-driven disruptive innovations.

    Figures and Tables | References | Related Articles | Metrics
    Design of knee joint variable stiffness protector based on lower limb biomechanical characteristics
    TAN Kun, WANG Xupeng, ZHAO Jiaxin, HUANG Yuzhe, LI Xu, LI Jiachen
    2024, 45(5): 1084-1095.  DOI: 10.11996/JG.j.2095-302X.2024051084
    HTML    PDF 50     57

    In order to meet the growing demand for wearable sports protective gear products in the context of national fitness movement and to address the problems of insufficient protective performance and low matching degree of sportswear in existing prefabricated protective gears, a new method was developed to analyze the skin deformation and curvature pressure distribution of the lower limbs during movement. First of all, key technologies and methods such as reverse engineering and clothing pressure calculation theory were employed to collect and qualitatively and quantitatively analyze key parameters like skin deformation and protective gear pressure during movement. This analysis revealed the regional pattern changes in lower limb skin deformation and curvature pressure distribution, leading to the construction of a functional zoning model for human knee joint protective gear. Secondly, the design of knee joint variable stiffness brace partition structure was completed according to different partition regions. The finite element simulation method was then used to construct a finite element model of the knee joint-guard, allowing for the comparative analysis of contact pressure distribution laws under various wearing conditions of multiple groups of guards. Finally, combined with the pressure and electromyography acquisition experiments of the guards, it was verified that the knee variable stiffness guards can optimize the pressure distribution of the knee joint, improve the wearing degree of fit, and enhance the athletic performance.

    Figures and Tables | References | Related Articles | Metrics
    Research on automatic generation and application of Ruyuan Yao embroidery based on self-attention mechanism
    LIU Zongming, HONG Wei, LONG Rui, ZHU Yue, ZHANG Xiaoyu
    2024, 45(5): 1096-1105.  DOI: 10.11996/JG.j.2095-302X.2024051096
    HTML    PDF 38     36

    To address the limitations of current style migration models in processing Ruyuan Yao embroidery images, especially in effectively handling abstract geometric transformations and the high noise in the generated images, a style migration model for Ruyuan Yao embroidery named SA-CycleGAN was proposed. By incorporating a self-attention mechanism and replacing the objective function for generating the adversarial loss with WGAN, the model significantly enhanced its ability to capture the style features of Ruyuan Yao embroidery, thereby optimizing the quality of style mapping. In terms of application, the proposed SA-CycleGAN model not only provided solid technical support for the automatic generation and online design system of Ruyuan Yao embroidery patterns, but also facilitated the construction of the corresponding database and digital sharing platform. Rigorous comparative experiments demonstrated that the optimized SA-CycleGAN model achieved excellent performance in the evaluation indexes for Ruyuan Yao embroidery pattern factors, its FID value was reduced by 16.1%, and the IS value was relatively improved by 13.2% compared with the original CycleGAN model, resulting in significantly improved image quality that was visually closer to the original Ruyuan Yao embroidery style. The establishment of the pattern design system of Ruyuan Yao embroidery greatly enhanced the design efficiency, injecting new vigor and value into the preservation and innovation of the ethnic group patterns.

    Figures and Tables | References | Related Articles | Metrics
    Research on evaluation method of coupling degree of interior and exterior styling images of smart vehicle
    ZHAO Fanghua, ZHANG Hangyuan, DING Man, LI Wenhua, PEI Huining
    2024, 45(5): 1106-1116.  DOI: 10.11996/JG.j.2095-302X.2024051106
    HTML    PDF 89     48

    To enhance designers’ control over the overall image conveyed by interior and exterior decoration in the early stage of intelligent vehicle modeling design, an evaluation method for assessing the coupling degree of interior and exterior decoration modeling was proposed, based on eye movement experiments and Kansei engineering theory. By employing a coupling theory approach, investigations were conducted to obtain perceptual image vocabularies and co-occurrence galleries for both interior and exterior decoration. Observation distance was introduced as an adjustment factor to standardize the image evaluation of interior and exterior decoration. The semantic differential method was employed to construct an overall image space for interior and exterior decoration, enabling us to derive an image transition line. Eye movement experiments were conducted to identify key design elements and their objective cognitive weights for both car interiors and exteriors. Following expert review and correction, comprehensive weights were calculated, which were then combined with image scores. This allowed us to explore an evaluation method based on cognitive coupling of smart car interiors and exteriors. The proposed model was applied to evaluate three smart cars, with results compared to scores obtained through semantic differential analysis. Numerical findings demonstrated that our proposed method could achieve high accuracy while effectively guiding subsequent coupling designs of smart car interiors and exteriors.

    Figures and Tables | References | Related Articles | Metrics
    Published as
    Published as 5, 2024
    2024, 45(5): 1117. 
    PDF 19     1813
    Related Articles | Metrics