Journal of Graphics ›› 2025, Vol. 46 ›› Issue (1): 70-80.DOI: 10.11996/JG.j.2095-302X.2025010070
• Image Processing and Computer Vision • Previous Articles Next Articles
WANG Yang1(), MA Chang1, HU Ming1, SUN Tao2, RAO Yuan3, YUAN Zhenyu1
Received:
2024-07-25
Accepted:
2024-10-14
Online:
2025-02-28
Published:
2025-02-14
About author:
First author contact:WANG Yang (1971-), professor, Ph.D. His main research interests cover artificial intelligence, augmented reality and ecological informatics, etc. E-mail:wycap@126.com
Supported by:
CLC Number:
WANG Yang, MA Chang, HU Ming, SUN Tao, RAO Yuan, YUAN Zhenyu. Lightweight wild bat detection method based on multi-scale feature fusion[J]. Journal of Graphics, 2025, 46(1): 70-80.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2025010070
Fig. 5 Examples of some bat images in the dataset((a) A colony of bats roosting in a cave; (b) A bat in flight; (c) A bat observed from the side; (d) A bat observed from the front; (e) A bat roosting on a rock wall; (f) A bat emitting sound waves; (g) A bat observed from the back; (h) A group of bats at night)
类别 | 数量/张 |
---|---|
大蹄蝠(Hipposideros armiger) | 614 |
普氏蹄蝠(Hipposideros pratti) | 496 |
棕蝠(Myoti) | 662 |
菊头蝠(Rousettus leschenaultia) | 656 |
总数 | 2428 |
Table 1 The information of bat dataset
类别 | 数量/张 |
---|---|
大蹄蝠(Hipposideros armiger) | 614 |
普氏蹄蝠(Hipposideros pratti) | 496 |
棕蝠(Myoti) | 662 |
菊头蝠(Rousettus leschenaultia) | 656 |
总数 | 2428 |
实验参数 | 数值 |
---|---|
训练轮次(Epoch) | 100 |
批量大小(Batchsize) | 4 |
线程(Workers) | 4 |
优化器(Optimizer) | AdamW |
初始学习率(lr0) | 0.000 1 |
动量因子(Momentum) | 0.9 |
权重衰减系数(Weight decay) | 0.000 1 |
Table 2 Experimental parameter setting
实验参数 | 数值 |
---|---|
训练轮次(Epoch) | 100 |
批量大小(Batchsize) | 4 |
线程(Workers) | 4 |
优化器(Optimizer) | AdamW |
初始学习率(lr0) | 0.000 1 |
动量因子(Momentum) | 0.9 |
权重衰减系数(Weight decay) | 0.000 1 |
方法 | 类别 | 评价指标/% | |||
---|---|---|---|---|---|
AP | P | R | mAP | ||
RT-DETR[ | 大蹄蝠 (Hipposideros armiger) | 93.8 | 90.5 | 92.4 | 95.1 |
普氏蹄蝠 (Hipposideros pratti) | 94.3 | ||||
棕蝠(Myoti) | 93.1 | ||||
菊头蝠 (Rousettus leschenaultia) | 99.3 | ||||
LiteDETR-Bat | 大蹄蝠 (Hipposideros armiger) | 96.4 | 96.0 | 93.0 | 97.2 |
普氏蹄蝠 (Hipposideros pratti) | 97.9 | ||||
棕蝠(Myoti) | 94.9 | ||||
菊头蝠 (Rousettus leschenaultia) | 99.4 |
Table 3 Comparison of detection evaluation indicators between RT-DETR and LiteDETR-Bat on bat dataset
方法 | 类别 | 评价指标/% | |||
---|---|---|---|---|---|
AP | P | R | mAP | ||
RT-DETR[ | 大蹄蝠 (Hipposideros armiger) | 93.8 | 90.5 | 92.4 | 95.1 |
普氏蹄蝠 (Hipposideros pratti) | 94.3 | ||||
棕蝠(Myoti) | 93.1 | ||||
菊头蝠 (Rousettus leschenaultia) | 99.3 | ||||
LiteDETR-Bat | 大蹄蝠 (Hipposideros armiger) | 96.4 | 96.0 | 93.0 | 97.2 |
普氏蹄蝠 (Hipposideros pratti) | 97.9 | ||||
棕蝠(Myoti) | 94.9 | ||||
菊头蝠 (Rousettus leschenaultia) | 99.4 |
方法 | RCELAN | DS-MFF | Para/M | GFLOPs | FPS/帧每秒 | mAP/% |
---|---|---|---|---|---|---|
RT-DETR | - | - | 18.95 | 57.0 | 99.4 | 95.1 |
RT-DETR | √ | - | 8.63 | 26.4 | 117.0 | 95.4 |
RT-DETR | - | √ | 19.26 | 61.5 | 63.4 | 95.5 |
本算法 | √ | √ | 10.12 | 35.6 | 191.3 | 97.2 |
Table 4 Impact of the improvement module on the performance of the model
方法 | RCELAN | DS-MFF | Para/M | GFLOPs | FPS/帧每秒 | mAP/% |
---|---|---|---|---|---|---|
RT-DETR | - | - | 18.95 | 57.0 | 99.4 | 95.1 |
RT-DETR | √ | - | 8.63 | 26.4 | 117.0 | 95.4 |
RT-DETR | - | √ | 19.26 | 61.5 | 63.4 | 95.5 |
本算法 | √ | √ | 10.12 | 35.6 | 191.3 | 97.2 |
方法 | Backbone | Weight/M | mAP/% | Para/M | GFLOPs | FPS/帧每秒 |
---|---|---|---|---|---|---|
YOLOv5[ | CSPDarknet53 | 40.3 | 97.2 | 19.91 | 48.3 | 158.1 |
YOLOv6[ | EfficientRep | 31.3 | 95.1 | 15.54 | 44.0 | 164.2 |
YOLOv7[ | CSPDarknet53 | 71.4 | 94.2 | 35.48 | 105.2 | 108.7 |
YOLOv8[ | CSPDarknet53 | 21.5 | 95.3 | 10.61 | 28.4 | 176.8 |
YOLOv9[ | - | 540.1 | 98.4 | 67.03 | 313.4 | 47.9 |
YOLOv10[ | - | 39.5 | 97.0 | 19.47 | 98.0 | 129.0 |
RT-DETR[ | ResNet18 | 38.6 | 95.1 | 18.95 | 57.0 | 99.4 |
RT-DETR_RCELAN | RCELAN | 17.7 | 95.4 | 8.62 | 26.4 | 117.0 |
RT-DETR_DS-MFF | ResNet18 | 39.2 | 95.5 | 19.26 | 61.5 | 63.4 |
LiteDETR-Bat | RCELAN | 20.8 | 97.2 | 10.12 | 35.6 | 191.3 |
Table 5 Performance comparison of different models on the bat dataset
方法 | Backbone | Weight/M | mAP/% | Para/M | GFLOPs | FPS/帧每秒 |
---|---|---|---|---|---|---|
YOLOv5[ | CSPDarknet53 | 40.3 | 97.2 | 19.91 | 48.3 | 158.1 |
YOLOv6[ | EfficientRep | 31.3 | 95.1 | 15.54 | 44.0 | 164.2 |
YOLOv7[ | CSPDarknet53 | 71.4 | 94.2 | 35.48 | 105.2 | 108.7 |
YOLOv8[ | CSPDarknet53 | 21.5 | 95.3 | 10.61 | 28.4 | 176.8 |
YOLOv9[ | - | 540.1 | 98.4 | 67.03 | 313.4 | 47.9 |
YOLOv10[ | - | 39.5 | 97.0 | 19.47 | 98.0 | 129.0 |
RT-DETR[ | ResNet18 | 38.6 | 95.1 | 18.95 | 57.0 | 99.4 |
RT-DETR_RCELAN | RCELAN | 17.7 | 95.4 | 8.62 | 26.4 | 117.0 |
RT-DETR_DS-MFF | ResNet18 | 39.2 | 95.5 | 19.26 | 61.5 | 63.4 |
LiteDETR-Bat | RCELAN | 20.8 | 97.2 | 10.12 | 35.6 | 191.3 |
Fig. 10 Visual comparison of different model checking effects ((a) Input image; (b) YOLOv5; (c) YOLOv7; (d) YOLOv9; (e) YOLOv10; (f) RT-DETR; (g) LiteDETR-Bat)
方法 | P/% | R/% | mAP/% | Para/M | FPS/帧每秒 |
---|---|---|---|---|---|
RT-DETR | 71.7 | 65.3 | 66.4 | 19.20 | 78.8 |
本文算法 | 74.8 | 67.0 | 70.0 | 11.14 | 104.9 |
Table 6 Comparison of generalization performance on the CUB_200_2011 dataset
方法 | P/% | R/% | mAP/% | Para/M | FPS/帧每秒 |
---|---|---|---|---|---|
RT-DETR | 71.7 | 65.3 | 66.4 | 19.20 | 78.8 |
本文算法 | 74.8 | 67.0 | 70.0 | 11.14 | 104.9 |
[1] | FAO. Global Forest Resources Assessment 2020-key findings[R]. Rome: FAO, 2020. |
[2] | 胡丹, 罗正汉, 叶福强, 等. 蝙蝠携带重要病毒研究进展[J]. 中国病原生物学杂志, 2023, 18(1): 111-116. |
HU D, LUO Z H, YE F Q, et al. Advances in bat carrier of important viruses[J]. Journal of Pathogen Biology, 2023, 18(1): 111-116 (in Chinese). | |
[3] | PETSO T, JAMISOLA JR R S, MPOELENG D. Review on methods used for wildlife species and individual identification[J]. European Journal of Wildlife Research, 2022, 68(1): 3. |
[4] |
朱强军, 胡斌, 汪慧兰, 等. 基于轻量化YOLOv8s交通标志的检测[J]. 图学学报, 2024, 45(3): 422-432.
DOI |
ZHU Q J, HU B, WANG H L, et al. Detection of traffic signs based on lightweight YOLOv8s[J]. Journal of Graphics, 2024, 45(3): 422-432 (in Chinese).
DOI |
|
[5] | CHEN X, ZHAO J, CHEN Y H, et al. Automatic standardized processing and identification of tropical bat calls using deep learning approaches[J]. Biological Conservation, 2020, 241: 108269. |
[6] | KRIVEK G, GILLERT A, HARDER M, et al. BatNet: a deep learning‐based tool for automated bat species identification from camera trap images[J]. Remote Sensing in Ecology and Conservation, 2023, 9(6): 759-774. |
[7] | XIE J J, ZHONG Y J, ZHANG J G, et al. A review of automatic recognition technology for bird vocalizations in the deep learning era[J]. Ecological Informatics, 2023, 73: 101927. |
[8] | PENG J B, WANG D L, LIAO X H, et al. Wild animal survey using UAS imagery and deep learning: modified Faster R-CNN for kiang detection in Tibetan Plateau[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 169: 364-376. |
[9] |
陈天华, 朱家煊, 印杰. 基于注意力机制的鸟类识别算法[J]. 计算机应用, 2024, 44(4): 1114-1120.
DOI |
CHEN T H, ZHU J X, YIN J. Bird recognition algorithm based on attention mechanism[J]. Journal of Computer Applications, 2024, 44(4): 1114-1120 (in Chinese).
DOI |
|
[10] |
苑朝, 赵亚冬, 张耀, 等. 基于YOLO轻量化的多模态行人检测算法[J]. 图学学报, 2024, 45(1): 35-46.
DOI |
YUAN C, ZHAO Y D, ZHANG Y, et al. Lightweight multi-modal pedestrian detection algorithm based on YOLO[J]. Journal of Graphics, 2024, 45(1): 35-46 (in Chinese).
DOI |
|
[11] | CARION N, MASSA F, SYNNAEVE G, et al. End-to-end object detection with transformers[C]// The 16th European Conference on Computer Vision. Cham: Springer, 2020: 213-229. |
[12] | HUANG Z H, LI H, XIONG X R, et al. End-to-end individual pig detection based on transfer learning[C]// The 6th International Conference on Pattern Recognition and Artificial Intelligence. New York: IEEE Press, 2023: 236-241. |
[13] | LUO H T, LUO X N, LI F, et al. Identification and detection of marine mammal dorsal fin based on deformable-DETR[C]// The 13th International Conference on Information Science and Technology. New York: IEEE Press, 2023: 349-357. |
[14] |
李刚, 张运涛, 汪文凯, 等. 采用DETR与先验知识融合的输电线路螺栓缺陷检测方法[J]. 图学学报, 2023, 44(3): 438-447.
DOI |
LI G, ZHANG Y T, WANG W K, et al. Defect detection method of transmission line bolts based on DETR and prior knowledge fusion[J]. Journal of Graphics, 2023, 44(3): 438-447 (in Chinese).
DOI |
|
[15] | ZHAO Y, LV W Y, XU S L, et al. DETRs beat YOLOs on real-time object detection[C]// 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2024: 16965-16974. |
[16] | HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 770-778. |
[17] | LIU W Z, LU H, FU H T, et al. Learning to upsample by learning to sample[C]// 2023 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2023: 6027-6037. |
[18] | SHI W Z, CABALLERO J, HUSZÁR F, et al. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 1874-1883. |
[19] | Ultralytics. YOLOv5 in PyTorch[EB/OL]. [2024-06-01]. https://github.com/ultralytics/yolov5. |
[20] | LI C Y, LI L L, JIANG H L, et al. YOLOv6: a single-stage object detection framework for industrial applications[EB/OL]. (2022-09-07) [2024-05-26]. https://arxiv.org/abs/2209.02976. |
[21] | WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 7464-7475. |
[22] | WANG C Y, YEH I H, LIAO H Y M. YOLOv9: learning what you want to learn using programmable gradient information[EB/OL]. (2024-02-29) [2024-05-26]. https://arxiv.org/abs/2402.13616. |
[23] | WANG A, CHEN H, LIU L H, et al. YOLOv10: real-time end-to-end object detection[EB/OL]. (2024-05-23) [2024-06-01]. https://arxiv.org/abs/2405.14458. |
[24] | WAH C, BRANSON S, WELINDER P, et al. The caltech-ucsd birds-200-2011 dataset[J]. California Institute of Technology, 2011, 32(1): 1-6. |
[1] | CHENG Xudong, SHI Caijuan, GAO Weixiang, WANG Sen, DUAN Changyu, YAN Xiaodong. Consistent and unbiased teacher model research for domain adaptive object detection [J]. Journal of Graphics, 2025, 46(1): 114-125. |
[2] | WANG Zhidong, CHEN Chenyang, LIU Xiaoming. The defect detection method for communication optical cables based on lightweight improved YOLOv8 [J]. Journal of Graphics, 2025, 46(1): 28-34. |
[3] | YUAN Chao, ZHAO Mingxue, ZHANG Fengyi, FENG Xiaoyong, LI Bing, CHEN Rui. Point cloud feature enhanced 3D object detection in complex indoor scenes [J]. Journal of Graphics, 2025, 46(1): 59-69. |
[4] | SUN Qianlai, LIN Shaohang, LIU Dongfeng, SONG Xiaoyang, LIU Jiayao, LIU Ruizhen. Few-shot pointer meters detection method based on meta-learning [J]. Journal of Graphics, 2025, 46(1): 81-93. |
[5] | LI Qiong, KAO Yueying, ZHANG Ying, XU Pei. Review on object detection in UAV aerial images [J]. Journal of Graphics, 2024, 45(6): 1145-1164. |
[6] | LI Zhenfeng, FU Shichen, XU Le, MENG Bo, ZHANG Xin, QING Jianjun. Research on gangue target detection algorithm based on MBI-YOLOv8 [J]. Journal of Graphics, 2024, 45(6): 1301-1312. |
[7] | YAN Jianhong, RAN Tongxiao. Lightweight UAV image target detection algorithm based on YOLOv8 [J]. Journal of Graphics, 2024, 45(6): 1328-1337. |
[8] | HU Fengkuo, YE Lan, TAN Xianfeng, ZHANG Qinzhan, HU Zhixin, FANG Qing, WANG Lei, MAN Xiaofeng. A refined YOLOv8-based algorithm for lightweight pavement disease detection [J]. Journal of Graphics, 2024, 45(5): 892-900. |
[9] | WANG Yaru, FENG Lilong, SONG Xiaoke, QU Zhuo, YANG Ke, WANG Qianming, ZHAI Yongjie. TFD-YOLOv8: a transmission line foreign object detection method [J]. Journal of Graphics, 2024, 45(5): 901-912. |
[10] | LIU Li, ZHANG Qifan, BAI Yuang, HUANG Kaiye. Research on multi-scale remote sensing image change detection using Swin Transformer [J]. Journal of Graphics, 2024, 45(5): 941-956. |
[11] | JIANG Xiaoheng, DUAN Jinzhong, LU Yang, CUI Lisha, XU Mingliang. Fusing prior knowledge reasoning for surface defect detection [J]. Journal of Graphics, 2024, 45(5): 957-967. |
[12] | ZHANG Dongping, WEI Yangyue, HE Shuji, XU Yunchao, HU Haimiao, HUANG Wenjun. Feature fusion and inter-layer transmission: an improved object detection method based on Anchor DETR [J]. Journal of Graphics, 2024, 45(5): 968-978. |
[13] | SUN Jilong, LIU Yong, ZHOU Liwei, LU Xin, HOU Xiaolong, WANG Yaqiong, WANG Zhifeng. Research on efficient detection model of tunnel lining crack based on DCNv2 and Transformer Decoder [J]. Journal of Graphics, 2024, 45(5): 1050-1061. |
[14] | LI Daxiang, JI Zhan, LIU Ying, TANG Yao. Improving YOLOv7 remote sensing image target detection algorithm [J]. Journal of Graphics, 2024, 45(4): 650-658. |
[15] | ZHANG Xinyu, ZHANG Jiayi, GAO Xin. ASC-Net: fast segmentation network for surgical instruments and organs in laparoscopic video [J]. Journal of Graphics, 2024, 45(4): 659-669. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||