Journal of Graphics

Cover of issue 3, 2023

2023, 44(3): 0.

Abstract ( 124 )

PDF (1667KB) ( 138 )

Related Articles | Metrics

Table of Contents for Issue 3, 2023

2023, 44(3): 1.

Abstract ( 65 )

PDF (233KB) ( 66 )

Related Articles | Metrics

Research progress of pattern classification based on machine learning

BIAN Kun, LIANG Hui

2023, 44(3): 415-426. DOI: 10.11996/JG.j.2095-302X.2023030415

Abstract ( 357 )

HTML ( 13 )

PDF (1779KB) ( 161 )

This paper presented a review of the research on pattern classification combined with machine learning at home and abroad. The study systematically sorted out the research methods, literature data analysis, pattern classification and machine learning applications, aiming to understand the current research status and research progress at home and abroad. This paper also summarized the methods and shortcomings of pattern classification and looked forward to the future development, providing reference for further exploration. Based on a large number of literature studies, CiteSpace software was employed to analyze the current research hotspots and trends. The methods, classical models, and the application of machine learning in pattern classification used in dataset construction, data processing, feature extraction, and pattern classification were analyzed and summarized in detail. The pattern classification research had a tendency of evolving from traditional manual classification to machine classification. The accurate and efficient acquisition of target pattern features can effectively improve the classification effect. There is a lack of databases in pattern classification research, which hinders the in-depth study of pattern classification. The systematic pattern database can be constructed by intelligent classification methods. The classification technology can be applied to the service system platform to realize the living inheritance of patterns. On the basis of classification, combined with various combination algorithms, innovative design of patterns can be carried out to promote the further development of patterns.

Figures and Tables | References | Related Articles | Metrics

Surface defect detection of threaded steel based on improved YOLOv5

HU Xin, ZHOU Yun-qiang, XIAO Jian, YANG Jie

2023, 44(3): 427-437. DOI: 10.11996/JG.j.2095-302X.2023030427

Abstract ( 509 )

HTML ( 20 )

PDF (18068KB) ( 263 )

An improved YOLOv5 algorithm for surface defect detection was proposed to solve the problems of low detection accuracy, high missed detection, and false detection rate in industrial scenarios. The improved YOLOv5 algorithm incorporated the multi-space pyramid pooling module (M-SPP) to optimize the network and the detection accuracy could be improved to a certain extent by increasing the depth of the network for better feature extraction. The improved spatial and coordinate attention module (SCA) was introduced to further distinguish the weight relationship between different pixels in the spatial domain, put more emphasis on the region of interest. This algorithm reduced the unnecessary regional weight and enhanced the model’s attention to small target defects. The double sampling transition module (TB) was utilized for downsampling to reduce the loss of important features and obtain more feature information. The k-means ++ algorithm was also employed to reunite the class anchor frame, and the generated preset anchor frame was more suitable for different sizes of defects, thereby improving the detection accuracy of the algorithm. The experimental results on the surface defect dataset of spiral steel showed that the improved YOLOv5 algorithm achieved good detection performance for the surface defect detection of spiral steel, superior to other compared algorithms. The improved YOLOv5 algorithm achieved an AP₅₀ of 97.6%, 3.2% higher than the YOLOv5 algorithm, and all other indexes showed an increase. While maintaining the original detection speed, the algorithm could accurately detect the surface defects of steel rebar.

Figures and Tables | References | Related Articles | Metrics

Defect detection method of transmission line bolts based on DETR and prior knowledge fusion

LI Gang, ZHANG Yun-tao, WANG Wen-kai, ZHANG Dong-yang

2023, 44(3): 438-447. DOI: 10.11996/JG.j.2095-302X.2023030438

Abstract ( 307 )

HTML ( 11 )

PDF (27048KB) ( 129 )

In order to address the problem of deep learning model unable to learn the prior knowledge of bolt targets, difficulty in locating its defects quickly and accurately only through visual features, and the limited number and unbalanced categories of bolt defect samples, this paper proposed the method of incorporating the deep learning model and the prior knowledge of bolts. DETR (detection transformer) was selected as the baseline model, and an improved DETR model was designed and implemented by incorporating DETR and prior knowledge. First, the visual-knowledge attention module was used to fuse the visual features of the bolt image with the prior knowledge of the bolt, generating the enhanced visual features corresponding to the bolts. Then, the enhanced visual features were sent to the DETR model framework, which was based on the Transformer encoding-decoding structure, thus identifying and classifying bolt targets. Finally, to overcome the problem of few and unbalanced samples of bolt critical defects, a class incremental learning loss function (CILLF) was introduced to enhance the identification ability of the model and alleviate the long-tail distribution problem of bolt defect samples. The simulation results demonstrated that the improved DETR model achieved an increase of 2.8 percentage points in mAP on the transmission line bolt defect sample compared with the baseline model DETR. Compared with the mainstream Faster R-CNN and YOLOv5l models, the improved DETR model showed significant improvement in detecting category images with few bolt defect samples under the long-tail distribution.

Figures and Tables | References | Related Articles | Metrics

Improved substation instrument target detection method for YOLOv5 algorithm

MAO Ai-kun, LIU Xin-ming, CHEN Wen-zhuang, SONG Shao-lou

2023, 44(3): 448-455. DOI: 10.11996/JG.j.2095-302X.2023030448

Abstract ( 403 )

HTML ( 12 )

PDF (9050KB) ( 176 )

An efficient and accurate edge instrument detection equipment is crucial for building intelligent substations. However, due to the complex environment of substation, it is challenging for mobile side devices to quickly and accurately detect small targets, multi-categories, and highly similar instrument targets. To address this, a power instrument target detection method based on lightweight SS-YOLOv5 network was proposed. The algorithm was built on YOLOv5 and utilized the lightweight network ShuffleNet V2 to improve the model’s network structure. Deep separable convolutions were introduced to extract instrument features, reducing the computational complexity of the model during neck fusion and improving detection speed. Additionally, combined with Swin Transformer, modeling was undertaken through shift window, allowing for global and local information interaction and enhancing feature extraction ability. Finally, the image data set of substation instrument was constructed independently to train, test, and verify the model. The experimental results demonstrated that, compared with the YOLOv5 algorithm, the model parameters were reduced by 91.8% and the target detection speed was increased by 43.8% for the detection of pressure gauge, ammeter, voltmeter, and other instrument images. It provided a feasible technical scheme for deployment to the side, and could accelerate the development of substation informatization and intelligence.

Figures and Tables | References | Related Articles | Metrics

YOLO-RD-Apple orchard heterogenous image obscured fruit detection model

HAO Peng-fei, LIU Li-qun, GU Ren-yuan

2023, 44(3): 456-464. DOI: 10.11996/JG.j.2095-302X.2023030456

Abstract ( 385 )

HTML ( 14 )

PDF (11207KB) ( 108 )

In order to address the challenge of robotic automated picking of highly occluded fruits in natural apple orchard environments, a YOLO-RD-Apple orchard heterogenous image occlusion fruit detection model based on dual inputs of RGB and Depth images was proposed. To reduce computational effort while ensuring the feature extraction capability, the lightweight MobileNetV2 and the lighter MobileNetV2-Lite, which was designed on the basis of MobileNetV2, were utilized as feature extractors for RGB and Depth images, respectively. Combining CSPNet with depth-separable convolution to accompany the SE attention module, the new SE-DWCSP3 module was proposed to improve the PANet structure and enhance the feature extraction capability of the network for stubby apple targets. Furthermore, the Soft NMS algorithm was introduced to replace the general NMS algorithm to address the false suppression phenomenon of the algorithm for dense targets and reduce the missed detection rate of obscured apples. The experimental results demonstrated the efficacy of this model on a natural obscured apple dataset, with an AP value of 93.1% on the test set, surpassing YOLOv4 by 1.4 percentage points, a 70% reduction in the number of parameters compared to YOLOv4, and a detection speed of 40.5 FPS on GPU (V100), which is 12.5% higher than that of YOLOv4. The proposed model exhibited improved detection accuracy and speed compared with YOLOv4, while simultaneously reducing the number of network parameters, making it more applicable to actual orchard apple picking scenarios.

Figures and Tables | References | Related Articles | Metrics

On-site monitoring technology of illegal swimming and fishing based on YoloX-ECA

LUO Wen-yu, FU Ming-yue

2023, 44(3): 465-472. DOI: 10.11996/JG.j.2095-302X.2023030465

Abstract ( 285 )

HTML ( 15 )

PDF (3564KB) ( 97 )

Every year in China, a large number of people die from drowning due to illegal swimming and fishing in reservoirs, rivers, and lakes. These bodies of water are often located in remote areas, making it difficult for staff to supervise them 24 hours a day. Existing target detection methods are either too slow or inaccurate, too large to deploy, or incapable of detecting illegal swimming and fishing in real-time. To address these shortcomings, we proposed the YoloX-ECA model with an attention model. By adding the efficient channel attention (ECA) block to the CSPLayer and FPN, we aimed to improve detection performance for swimming and fishing while maintaining detection speed. Experimental results on self-made datasets showed that the YoloX-ECA achieved over 90% AP for the detection of swimming and fishing classes, with a detection speed of 62.29 fps. Compared with YoloX, mAP was increased by 1.21%. Furthermore, YoloX-ECA’s performance also outperformed other target detection algorithms such as Faster-RCNN. The improved YoloX-ECA model achieved the expected design goals and displayed great prospects for application in the field of intelligent supervision of rivers and lakes.

Figures and Tables | References | Related Articles | Metrics

Natural scene text detection based on attention mechanism and deep multi-scale feature fusion

LI Yu, YAN Tian-tian, ZHOU Dong-sheng, WEI Xiao-peng

2023, 44(3): 473-481. DOI: 10.11996/JG.j.2095-302X.2023030473

Abstract ( 354 )

HTML ( 24 )

PDF (3415KB) ( 148 )

A scene text detection method based on attention mechanism and deep multi-scale feature fusion was proposed to address the issue that existing scene text detection methods cannot deeply mine and fully fuse discriminative multi-scale text instance features. The ResNeSt50 network with attention enhancement served as the backbone network to extract more discriminative feature representation related to text instance across different scales. Furthermore, a deep multi-scale feature fusion module was designed to interact with feature information related to feature maps of different scales. This module adaptively learned the corresponding weight matrix related to feature maps of different scales, which were used to further mine and fuse discriminative feature information about text instances on feature maps of different scales, thus yielding a robust multi-scale fusion feature map. Finally, an adaptive binarization post-processing module was adopted to generate a more accurate text area bounding box. To evaluate the effectiveness of the proposed method, extensive experiments were conducted on ICDAR2015, ICDAR2013, and CTW1500 datasets. The results demonstrated that the proposed method achieved competitive detection results compared with other advanced detection methods and presented excellent robustness and generalization ability.

Figures and Tables | References | Related Articles | Metrics

Segmentation of laser coding characters based on residual and feature-grouped attention

XIAO Tian-xing, WU Jing-jing

2023, 44(3): 482-491. DOI: 10.11996/JG.j.2095-302X.2023030482

Abstract ( 174 )

HTML ( 2 )

PDF (2334KB) ( 62 )

Laser coding on metal surface can lead to denaturation of surrounding metal and generate a significant amount of noise in the form of burns. This results in complex backgrounds in the character region, low contrast, and ambiguity of characters, which can make subsequent character recognition challenging. In response, Res18-UNet, a novel laser coding character feature enhancement and fine segmentation model, was proposed. The proposed model was based on residual and feature-grouped attention to highlight character information and improve signal-to-noise ratio, thus effectively segmenting the target. Firstly, the A-R unit was designed to reduce network parameters, effectively avoid network degradation, and improve the feature selection ability in channels and spaces. Secondly, the feature-grouped attention mechanism was proposed, and the improved spatial attention was added to enhance weak character features. In addition, a deep supervision module integrating the improved loss function was designed in the upsampling stage to improve network convergence and enhance segmentation precision. According to the experiment on the image dataset of the can bottoms with laser coding, the proposed model outperformed the original UNet model in terms of mIoU, Dice coefficient, and F1 score. Specifically, the proposed model achieved 0.801 0, 0.889 5, and 0.903 5, respectively, and attained the prediction speed 2.6 times that of the original UNet at 12.24 images/s. Experiments have proven that this algorithm can effectively enhance the features of low contrast laser coded characters and segment them with high precision, and that it has the feasibility and application prospect of deployment and operation on embedded platforms.

Figures and Tables | References | Related Articles | Metrics

Graph element detection matching based on Republic of China banknotes

WANG Jia-jing, WANG Chen, ZHU Yuan-yuan, WANG Xiao-mei

2023, 44(3): 492-501. DOI: 10.11996/JG.j.2095-302X.2023030492

Abstract ( 154 )

HTML ( 4 )

PDF (4493KB) ( 45 )

In view of the fact that there are numerous types of Republic of China banknotes, which often have slight visual differences between different banknote, combined with the issues of mold, burrs or breakage after circulation, the recognition and classification ability of traditional fine-grained image retrieval methods for Republican banknotes is inadequate. To address these issues, this paper proposed a fine-grained retrieval model of Republican banknotes based on multiscale feature fusion. To reduce the time of manual data labeling, YOLOv4 was employed for graph element detection on banknote images, with the main view of banknotes being adopted as the input feature map. EfficientNet-B0 was utilized as the backbone network for retrieval, thereby reducing the burden of redundant information in the network and enhancing network accuracy. In the model, the feature vectors of layers 2, 4, 10, and 15 of the PANet fusion network were utilized to generate a global feature vector library, improving the banknote matching retrieval capability. Furthermore, the feature vectors were clustered using adaptive K-means to simplify the matching time and computation. The experimental results demonstrated that the proposed model achieved an accuracy of 89.6%, improving the retrieval accuracy by 10 percentage points compared to using the original image of banknotes as the input image. The improved model exhibited better classification performance, less inference time cost, and fine classification of banknotes. These results could meet the practical requirements of industry.

Figures and Tables | References | Related Articles | Metrics

Research on cyclic generative network oriented to inter-layer interpolation of medical images

SUN Long-fei, LIU Hui, YANG Feng-chang, LI Pan

2023, 44(3): 502-512. DOI: 10.11996/JG.j.2095-302X.2023030502

Abstract ( 210 )

HTML ( 6 )

PDF (5106KB) ( 65 )

Due to the limitations of imaging equipment performance and radiation dose, the inter-layer resolution of CT and MRI image sequences is considerably lower than the intra-layer resolution, which greatly impedes the application of medical images. Consequently, the problem of effectively improving the inter-layer resolution of medical image has become a critical concern. To address this problem, a cyclic generative network was proposed for inter-layer interpolation of medical images. The network mapped medical images into their corresponding binary images to achieve simple and fluent inter-layer interpolation processing for consecutive medical image sequences. The proposed network was composed of two modules. On the one hand, the image transformation module was designed with a generator sub-network containing 9 residual blocks and 2 bilinear upsampling modules to achieve effective image transformation. Then, the cyclic mapping between medical images and their corresponding binary images was achieved by the bidirectional nonlinear mapping capability learned by the module. On the other hand, the interpolation module combined motion estimation and image generation into a single convolution step and constructed a binary image Charbonnier difference loss function suitable for medical image features to further improve image resolution and perform interpolation processing for the binary image sequences. Experimental results on five multi-type datasets demonstrated that the proposed method surpassed advanced comparison methods in terms of the average signal-to-noise ratio (PSNR) and structural similarity index metric (SSIM) of the generated images, and was also better at handling detailed information such as image edges and contours.

Figures and Tables | References | Related Articles | Metrics

Fine-grained classification model of lung disease for imbalanced data

LIU Bing, YE Cheng-xu

2023, 44(3): 513-520. DOI: 10.11996/JG.j.2095-302X.2023030513

Abstract ( 221 )

HTML ( 7 )

PDF (1847KB) ( 78 )

There are various types of lung diseases, each having distinct imaging manifestations. However, medical image data related to these diseases often suffer from category imbalance, making it difficult to distinguish them using general deep learning models. To tackle these problems, a fine-grained classification model for lung diseases based on imbalanced data was proposed. The model had a two-branch feature extraction structure, namely EfficientNetB0 and MobileNetV2 with a convolutional block attention module (CBAM). The attention mechanism was utilized to enhance the weight of important features in the images. After feature extraction, the features were fused based on multi-mode bilinear pooling, and the Focal Loss function was used to improve the classification effect of imbalanced data. The model was trained using the strategy of hyperparameter adaptive adjustment, and finally the classification was completed. Grad-CAM was also employed to visualize the concerns of the model to address the interpretability of the classification. The experimental results demonstrated that the proposed model achieved a classification accuracy of 0.985, a Kappa coefficient of 0.973, and an F1 value of 0.981. All the evaluation indexes have been significantly improved, which has exhibited excellent classification performance and can act as a helpful tool for the auxiliary diagnosis of lung diseases.

Figures and Tables | References | Related Articles | Metrics

Bi-directionally aligned VAE based on double semantics for generalized zero-shot learning

SHI Cai-juan, SHI Ze, YAN Jin-wei, BI Yang-yang

2023, 44(3): 521-530. DOI: 10.11996/JG.j.2095-302X.2023030521

Abstract ( 241 )

HTML ( 13 )

PDF (1127KB) ( 77 )

Generalized zero-shot learning (GZSL) aims to recognize both seen and unseen classes by utilizing the relationship between visual features and semantic information. However, existing GZSL methods mostly rely on generative models to generate pseudo visual features for unseen classes. The problem with these models is that they commonly employ unidirectional VAE and a single type of semantic prototype, which limits the obtained semantic information of unseen classes. To address this issue, a bi-directionally aligned VAE based on a double semantics model (BAVAE-DS) for GZSL was proposed. First, two types of prototypes, i.e., user-defined attributes and word vectors, were adopted to steadily generate two types of pseudo visual features respectively using the bi-directionally aligned VAE. This resulted in abundant semantic information that could be used to represent unseen classes. Next, a feature fusion model was designed to fuse the two types of pseudo visual features and remove the redundancy, thus enhancing the pseudo visual features. Finally, classification regularization was employed to enhance the independence of classes in the classification module. Extensive experiments were conducted on three benchmark datasets and the results were compared with other methods, proving the effectiveness of the proposed model.

Figures and Tables | References | Related Articles | Metrics

Semantic segmentation with fusion of spatial criss-cross and channel multi-head attention

WU Wen-huan, ZHANG Hao-kun

2023, 44(3): 531-539. DOI: 10.11996/JG.j.2095-302X.2023030531

Abstract ( 286 )

HTML ( 10 )

PDF (3282KB) ( 120 )

In light of the shortcomings of current semantic segmentation methods, which suffer from ineffective construction of contextual semantic associations and insufficient representation of extracted semantic features, a novel semantic segmentation network that combines spatial criss-cross attention and channel attention was proposed. Firstly, the spatial criss-cross attention module (SCCAM) was adopted to aggregate context information of each target pixel in the horizontal and vertical directions, thus enabling efficient construction of non-local semantic dependencies between pixels. Secondly, the multi-head attention mechanism was introduced in the channel attention module (CAM) to mine channel features with more significant semantics on multiple channel subspaces. Finally, the semantic representation capability was strengthened by merging attention features on both spatial and channel dimensions, thereby improving the precision of semantic segmentation. The experimental results on several datasets, including Cityscapes, PASCAL VOC2012, and CamVid demonstrated that the proposed network model outperformed other state-of-the-art semantic segmentation methods in terms of segmentation accuracy.

Figures and Tables | References | Related Articles | Metrics

Optimization of 3D reconstruction method for urban cable tunnel based on SfM method

GE Hai-ming, ZHANG Wei, WANG Xiao-long, ZHU Jing-jing, JIA Fei, XUE Ya-dong

2023, 44(3): 540-550. DOI: 10.11996/JG.j.2095-302X.2023030540

Abstract ( 324 )

HTML ( 15 )

PDF (13629KB) ( 132 )

The three-dimensional reconstruction technology is extensively applied to various scenarios such as urban subway tunnels and railway tunnels to visualize tunnel structure and analyze tunnel properties. For urban cable tunnels, due to many internal obstacles with smaller sections than subway tunnels, there are some difficulties in using the traditional three-dimensional reconstruction methods such as 3D laser scanners. Photogrammetry technology, with its portable equipment, holds significant potential in the three-dimensional reconstruction of cable tunnels. Based on photogrammetry theory, this paper summarized and analyzed various factors affecting the effect of 3D reconstruction of cable tunnels and established a model evaluation method for 3D reconstruction of cable tunnel scenes. Experiments were carried out in a gas insulated transmission line (GIL) cable tunnel in Nantong, Jiangsu Province. To mitigate the problem of uneven brightness and darkness in the original image, histogram equalization and gamma transform were employed for enhanced preprocessing, thereby restoring the lining details of the dark part of the tunnel image. Based on the structure from motion (SfM) 3D reconstruction method, a 3D model of the cable tunnel with complete internal texture information was successfully constructed. Taking the model evaluation method as the judgment standard, the key factors such as image acquisition method and modeling parameters were optimized. The results revealed that employing 200 images and moderate quality modeling can ensure high point cloud density (1.0×10⁵ m), a reasonable number of central feature points (321), a low RMS reprojection error (1.36 pix), and a high modeling efficiency (177 s), meeting the needs of tunnel inspection.

Figures and Tables | References | Related Articles | Metrics

A generative network based on non-local information for atmospheric polarization modelling

YAN Yuan, GAO Xin-jian, GAO Jun, WANG Xin, CHENG Qian

2023, 44(3): 551-559. DOI: 10.11996/JG.j.2095-302X.2023030551

Abstract ( 141 )

HTML ( 3 )

PDF (4085KB) ( 45 )

As a stable natural attribute, the atmospheric polarization mode is widely used in various fields such as navigation and detection because it contains the ∞ shape feature and meridian feature with directional information. However, obtaining atmospheric polarization information in real weather conditions is a challenging task due to the limitations imposed by dynamic cloud interference. This limitation causes the distribution law of atmospheric polarization information to be destroyed and, in turn, leads to the loss of some information. To address this issue, we proposed an atmospheric polarization mode generation method based on non-local information. Then, a non-local information inpainting block was designed for two-stage repair. In the first stage, the non-local spatial continuity information of the atmospheric polarization mode was mined to enhance the global structure between feature information and realize spatial information repair. In the second stage, the feature mapping relationship was established between atmospheric polarization information at different times, and the time continuity of the non-local atmospheric polarization information distribution was employed to repair feature information of superimposed noise regions in the time dimension. The experimental results on the Temporal Polarization 1072 polarization dataset qualitatively and quantitatively demonstrated the efficacy of this method in effectively removing cloud interference noise in the atmospheric polarization mode and repairing polarization information of missing areas, and higher structural and semantic consistency of the generated results.

Figures and Tables | References | Related Articles | Metrics

A 3D point cloud defense framework combined with adversarial examples detection and reconstruction

ZHAO Yu-kun, REN Shuang, ZHANG Xin-yun

2023, 44(3): 560-569. DOI: 10.11996/JG.j.2095-302X.2023030560

Abstract ( 206 )

HTML ( 4 )

PDF (1124KB) ( 57 )

The development of 3D point cloud deep neural networks has enabled their application in many high-security tasks. However, adversarial examples could easily lead the normally trained deep learning models to make incorrect predictions, making it essential to improve the robustness of input data to deep neural networks. The existing 3D point cloud defense networks are inefficient and fail to recover the surface deformation of the point cloud and point distribution adequately. To address these issues, a 3D point cloud adversarial defense network framework combining adversarial example detection and reconstruction was proposed. The input sample was first detected by an error-based detector before and after reconstruction. If it was an adversarial example, it was then reconstructed by a variational autoencoder-based reformer before being fed into the classification network. The variational autoencoder’s structure enhanced the learning of numerical voids on the hidden space, and the same number of points before and after reconstruction ensured efficient subsequent networks and better recovery of the point cloud shape. For the experiments, a variety of classical classification models were attacked on the ModelNet40 dataset, and the effectiveness of the detector-reformer defense framework against these attacks was tested. The experiments demonstrated that the defense method outperformed all other defense methods in terms of classification accuracy on PointNet and especially performed well in the attack based on the saliency map and the adversarial generation network. The detector-reformer defense network framework could improve the accuracy from 47.65% to 75.02% on the dropping attacks with 200 points lost. The effectiveness of the detector and reformer on the overall classification accuracy was demonstrated by ablation experiments and visual reconstruction results.

Figures and Tables | References | Related Articles | Metrics

Regional hierarchical mesh simplification algorithm for feature retention

ZHU Tian-xiao, YAN Feng-ting, SHI Zhi-cai

2023, 44(3): 570-578. DOI: 10.11996/JG.j.2095-302X.2023030570

Abstract ( 279 )

HTML ( 10 )

PDF (8784KB) ( 132 )

As the accuracy of 3D modeling continues to improve, the data size of mesh models is increasing proportionally. Thus, simplifying mesh models is essential to facilitate storage and computation. However, most mesh simplification algorithms usually set a single simplification rate for the entire model, and they are unable to retain local features through different levels of simplification. To address this limitation, a hierarchical mesh simplification algorithm called regional hierarchical quadric error metric algorithm (RH-QEM) for local feature retention was proposed. First, the algorithm segmented the mesh model using spectral clustering and constructed the kernel function using geodesic and cosine distances. Then a curvature metric based on normal vectors was constructed to measure the curvature degree of different localities of the mesh model, according to which the graded grid simplification rate was set. Different regions were mapped to different simplification rates. Finally, the algorithm constructed an improved edge folding cost function to achieve graded simplification for different regions of the grid model. Experiments were conducted on CAD models and scanned models. The experimental results demonstrated that the RH-QEM algorithm outperformed three compared algorithms, as it could reduce the simplification errors and enhance the mesh quality, thus realizing graded simplification and effectively maintaining the detailed features of the model.

Figures and Tables | References | Related Articles | Metrics

Preliminary study of density modeling method

SHEN Wan-qiang

2023, 44(3): 579-587. DOI: 10.11996/JG.j.2095-302X.2023030579

Abstract ( 132 )

HTML ( 2 )

PDF (1156KB) ( 52 )

The traditional free curve modeling system can be described as “a (discrete) weighted average of a (discrete) sequence of control points with respect to a (discrete) sequence of basis functions.” This discrete property has been transformed into a continuous property, which could be described as “a (continuous) integral average of a (continuous) curve with respect to a (continuous) function family.” The corresponding change was similar to the transformation from the mathematical expectation of a discrete random variable defined by probability distribution law to the mathematical expectation of a continuous random variable defined by probability density functions in probability theory. Hence, the modeling method with continuous property was referred to as the density modeling method, where the continuous curve was known as a control curve, and the continuous function family was referred to as a basis density function. To preliminarily explore the density modeling method, we presented its model, constructed a basic density function of degree 1 and 2 satisfying non-negativity, normalization, and symmetry properties, and examined the derivatives of the basic density functions and the moment functions of the corresponding random variable. During density modeling, an arbitrary polynomial or even non-polynomial parametric curves could be used as the input, and the output curve was a polynomial curve of degree 1 or 2, respectively. The density modeling curve possesses properties such as convex hull, affine invariance, and symmetry properties.

Figures and Tables | References | Related Articles | Metrics

Study on multi-scale visual analysis method of activated corrosion products in fusion reactor

LUO Yue-tong, YANG Meng-nan, PENG Jun, ZHOU Bo, ZHANG Yan-kong

2023, 44(3): 588-598. DOI: 10.11996/JG.j.2095-302X.2023030588

Abstract ( 168 )

HTML ( 2 )

PDF (2087KB) ( 53 )

Fusion energy is regarded as the ultimate clean energy for human use, and safety is paramount for the advancement of fusion energy. Experts in this field must ensure the safety of the entire process from production to usage. Currently, the problem of activated corrosion products in the cooling pipes of fusion reactors poses a significant challenge to safety. These products are the primary sources of radioactivity in fusion reactors and cause safety problems. They have a critical impact on the shielding design, personnel protection, and accident consequences of fusion reactors. Moreover, the activated corrosion products are distributed throughout the cooling pipe and accumulate over time. Therefore, studying their distribution characteristics in cooling pipes is essential for ensuring fusion safety. At this stage, experts in the field lack a method that can quickly and efficiently analyze the distribution characteristics of the activated corrosion products. During the research process, various parts of the cooling pipe of the fusion reactor were classified according to different characteristics, and a large number of components were obtained. Therefore, the activated corrosion products are widely distributed in the components and are composed of various radioactive substances such as Co57, Co60, and Mn54. The composition and distribution of the activated corrosion products evolve over time; therefore, the activated corrosion product data are multivariate time-series data. Considering the characteristics of the activated corrosion product data and the analytical needs of domain experts, a visual analysis system was designed and developed. Its main features include the following: ① To address the problem of too many components of the cooling pipe, it proposed a multi-level clustering analysis approach to divide the components and help domain experts quickly select representative components for in-depth analysis, thereby avoiding checking all components one by one. ② The system addressed the challenge of a large time span by automatically extracting potential critical time periods using the multi-granularity division method, thus enabling domain experts to focus on critical time periods. ③ A set of multi view linkage visual analysis systems was designed to assist domain experts in quickly obtaining the overall distribution characteristics of activated corrosion products through flexible interaction. Based on the above characteristics, a Web-based system that can conduct multi-scale and multi-granularity analyses of fusion reactor activated corrosion products was designed. The International Thermonuclear Experimental Reactor (ITER) is currently the largest international cooperation project in the world. The effectiveness of the method was tested using the activated corrosion product data of the ITER reactor from 2013 to 2020. Domain experts pointed out that the tool is useful in selecting representative components and key time periods and can greatly improve their work efficiency.

Figures and Tables | References | Related Articles | Metrics

Exploration on the modeling method of complex dynamic system integrating multi-agent and hypergraph

WANG Peng-fei, TAO Ti-wei, JIAO Dian, SHEN Yan-ming, ZHOU Dong-sheng, ZHANG Qiang

2023, 44(3): 599-608. DOI: 10.11996/JG.j.2095-302X.2023030599

Abstract ( 416 )

HTML ( 7 )

PDF (1617KB) ( 121 )

Most systems can be abstracted as complex systems in nature and human society. Given the increasing complexity of today’s complex systems, there is an urgent need for advanced and mature complex system theories and methods for modeling research and processing. However, the current graph-based modeling methods used in complex systems encounter difficulties in depicting the extremely complex connections between nodes and the higher-order relationships between them. Additionally, these methods face challenges in effectively portraying the intelligent perception, decision-making, and control of complex systems. A modeling method for complex dynamic systems integrating multi-agents and hypergraphs was proposed to address these issues. This model dynamically evolved from several different evolutionary angles to specifically describe complex dynamic systems. This model enabled perception, decision-making, and control by conferring agent nodes with intelligent features in complex systems and better describing the higher-order relationships between agent nodes. As a result, this modeling approach provided novel ideas and methods for the study of the intelligence theory of complex systems.

Figures and Tables | References | Related Articles | Metrics

Design of digital twin system for forging hydraulic press

TANG Peng, SA Guo-dong, LIU Zhen-yu, TAN Jian-rong

2023, 44(3): 609-615. DOI: 10.11996/JG.j.2095-302X.2023030609

Abstract ( 419 )

HTML ( 11 )

PDF (2836KB) ( 148 )

The forging hydraulic press is an essential piece of equipment for achieving rapid forging of super large parts. To enhance the forging performance and reliability of hydraulic press forging, this paper focused on digital twin-driven forging hydraulic press working state perception and virtual reality synchronization. By analyzing the real-time perception demand of the hydraulic forging press’s working state and early warning of forging risks, a digital twin framework for forging hydraulic press was constructed, including a physical hardware layer, perception control layer, twin model layer, and visual interaction layer. The digital twin system of the forging hydraulic press was driven online by sensor data and simulation data, enabling early warning of forging risk through real-time simulation. This system visualized the forging process of the forging hydraulic press, improved the reliability of the working process of the forging press, and provided key technical support for the realization of digital twin in the production process of forging hydraulic presses.

Figures and Tables | References | Related Articles | Metrics

Research on semi-automatic geo-referencing approach in city information modeling

WANG Heng-wei, HU Zhen-zhong, ZHAO Yan-lai, ZHONG Jian, CHEN Yi

2023, 44(3): 616-624. DOI: 10.11996/JG.j.2095-302X.2023030616

Abstract ( 212 )

HTML ( 3 )

PDF (8180KB) ( 94 )

When creating a city information model (CIM), it is often necessary to geo-reference a 3D model that lacks spatial reference in a geographic information system (GIS). Based on a geometric reference point, semi-automatic geo-referencing could be achieved while ensuing accuracy. After the 3D model was imported into the GIS, the local coordinate system (LCS) was transformed to the surface of the geodetic ellipsoid, based on any reference point near the model. Afterwards, the geographic reference point on the 3D model was picked up in the GIS, and the LCS coordinates were calculated according to the current transformation matrix. Combined with the geodetic coordinates of the reference point, the correct transformation matrix of the model could be obtained. If there was no geographic reference point, artificial geo-referencing could be achieved by continuously estimating the geographic reference point based on this approach. The semi-automatic georeferencing approach was realized using Cesium and applied using 3D models from actual projects, demonstrating its feasibility. The above approaches were reliant on the spatial transformation algorithms from the LCS to ECEF (earth-centered earth-fixed) coordinate system using a single reference point. The algorithms were categorized into four types: basic approach, reference point projection approach, origin projection approach, and approximate origin approach, each with distinct features and applicable scenarios. The latter 3 algorithms were applied in the verification process, and based on the calculated results, the influence of the distance between the reference point and the LCS origin on these algorithms was analyzed.

Figures and Tables | References | Related Articles | Metrics

Published as 3, 2023

2023, 44(3): 624.

Abstract ( 43 )

PDF (130118KB) ( 25 )

Related Articles | Metrics

22 references in Issue 3, 2023

2023, 44(3): 625.

Abstract ( 65 )

PDF (147KB) ( 18 )

Related Articles | Metrics