Journal of Graphics ›› 2024, Vol. 45 ›› Issue (6): 1328-1337.DOI: 10.11996/JG.j.2095-302X.2024061328
• Image Processing and Computer Vision • Previous Articles Next Articles
Received:
2024-06-13
Accepted:
2024-09-26
Online:
2024-12-31
Published:
2024-12-24
About author:
First author contact:YAN Jianhong (1972-), professor, Ph.D. Her main research interests cover machine learning, computer vision. E-mail:yan_jian_hong@163.com
Supported by:
CLC Number:
YAN Jianhong, RAN Tongxiao. Lightweight UAV image target detection algorithm based on YOLOv8[J]. Journal of Graphics, 2024, 45(6): 1328-1337.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2024061328
Fig. 8 Comparison of different detection networks ((a) YOLOv8s network; (b) Four-layer detection network; (c) Reconstructing the small target detection network)
参数名称 | 参数值 |
---|---|
epoch | 300 |
batch-size | 8 |
初始学习率 | 0.01 |
优化器 | SGD |
momentum | 0.937 |
图像分辨率 | 640×640 |
Table 1 Training parameters
参数名称 | 参数值 |
---|---|
epoch | 300 |
batch-size | 8 |
初始学习率 | 0.01 |
优化器 | SGD |
momentum | 0.937 |
图像分辨率 | 640×640 |
实验 | P/ % | R/ % | mAP/ % | Weight/ MB | FPS |
---|---|---|---|---|---|
YOLOv8s | 50.2 | 38.0 | 38.8 | 22.5 | 110 |
替换backbone | 49.9 | 38.6 | 39.4 | 18.7 | 238 |
替换neck | 50.3 | 37.9 | 39.3 | 18.8 | 161 |
全部替换 | 49.0 | 37.6 | 38.8 | 16.0 | 251 |
Table 2 FasterC2f ablation experiment
实验 | P/ % | R/ % | mAP/ % | Weight/ MB | FPS |
---|---|---|---|---|---|
YOLOv8s | 50.2 | 38.0 | 38.8 | 22.5 | 110 |
替换backbone | 49.9 | 38.6 | 39.4 | 18.7 | 238 |
替换neck | 50.3 | 37.9 | 39.3 | 18.8 | 161 |
全部替换 | 49.0 | 37.6 | 38.8 | 16.0 | 251 |
模型 | P/ % | R/ % | mAP/ % | Weight/ MB | FPS |
---|---|---|---|---|---|
YOLOv8s | 50.2 | 38.0 | 38.8 | 22.5 | 110 |
YOLOv8s+BiFPN | 50.0 | 39.9 | 40.7 | 56.7 | 149 |
YOLOv8s+AFPN | 49.6 | 36.1 | 37.7 | 17.2 | 207 |
YOLOv8s+FD-FPN | 51.5 | 38.2 | 39.9 | 21.1 | 176 |
Table 3 Comparison experiment of different feature pyramid networks
模型 | P/ % | R/ % | mAP/ % | Weight/ MB | FPS |
---|---|---|---|---|---|
YOLOv8s | 50.2 | 38.0 | 38.8 | 22.5 | 110 |
YOLOv8s+BiFPN | 50.0 | 39.9 | 40.7 | 56.7 | 149 |
YOLOv8s+AFPN | 49.6 | 36.1 | 37.7 | 17.2 | 207 |
YOLOv8s+FD-FPN | 51.5 | 38.2 | 39.9 | 21.1 | 176 |
模型 | P/ % | R/ % | mAP/ % | Weight/ MB | FPS |
---|---|---|---|---|---|
YOLOv8s | 50.2 | 38 | 38.8 | 22.5 | 110 |
YOLOv8s+ 小目标检测层 | 55.3 | 41.1 | 44.0 | 36.8 | 108 |
YOLOv8s+ 重构小目标检测层 | 55.1 | 42.0 | 42.5 | 7.4 | 101 |
Table 4 Comparison experiment of different detection networks
模型 | P/ % | R/ % | mAP/ % | Weight/ MB | FPS |
---|---|---|---|---|---|
YOLOv8s | 50.2 | 38 | 38.8 | 22.5 | 110 |
YOLOv8s+ 小目标检测层 | 55.3 | 41.1 | 44.0 | 36.8 | 108 |
YOLOv8s+ 重构小目标检测层 | 55.1 | 42.0 | 42.5 | 7.4 | 101 |
实验 | P/% | R/% | mAP/% | Para/M | Weight/MB | FPS |
---|---|---|---|---|---|---|
YOLOv8s | 50.2 | 38 | 38.8 | 11.17 | 22.5 | 110 |
+FasterC2f | 49.9 | 38.6 | 39.4 | 9.68 | 18.7 | 238 |
+FD-FPN | 51.5 | 38.2 | 39.3 | 10.97 | 21.1 | 176 |
+SCDH | 51.5 | 38.8 | 39.5 | 9.43 | 18.2 | 204 |
+重构小目标 | 55.1 | 42 | 44.3 | 3.53 | 7.4 | 101 |
+FasterC2f +重构小目标 | 52.9 | 41.7 | 43.1 | 3.03 | 6.1 | 155 |
+FD-FPN +SCDH | 52.0 | 39.0 | 40.1 | 9.17 | 17.7 | 185 |
+FasterC2f +重构小目标+SCDH | 52.1 | 41.9 | 43.4 | 4.17 | 5.2 | 150 |
Ours | 55.3 | 43.4 | 45.4 | 3.62 | 7.2 | 128 |
Table 5 Ablation experiment
实验 | P/% | R/% | mAP/% | Para/M | Weight/MB | FPS |
---|---|---|---|---|---|---|
YOLOv8s | 50.2 | 38 | 38.8 | 11.17 | 22.5 | 110 |
+FasterC2f | 49.9 | 38.6 | 39.4 | 9.68 | 18.7 | 238 |
+FD-FPN | 51.5 | 38.2 | 39.3 | 10.97 | 21.1 | 176 |
+SCDH | 51.5 | 38.8 | 39.5 | 9.43 | 18.2 | 204 |
+重构小目标 | 55.1 | 42 | 44.3 | 3.53 | 7.4 | 101 |
+FasterC2f +重构小目标 | 52.9 | 41.7 | 43.1 | 3.03 | 6.1 | 155 |
+FD-FPN +SCDH | 52.0 | 39.0 | 40.1 | 9.17 | 17.7 | 185 |
+FasterC2f +重构小目标+SCDH | 52.1 | 41.9 | 43.4 | 4.17 | 5.2 | 150 |
Ours | 55.3 | 43.4 | 45.4 | 3.62 | 7.2 | 128 |
模型 | P/% | R/% | mAP/% | Para/M |
---|---|---|---|---|
YOLOv5s | 44.9 | 35.1 | 33.7 | 7.13 |
YOLOv6s | 47.3 | 36.4 | 36.8 | 16.30 |
YOLOv7 | 46.0 | 37.9 | 34.5 | 6.04 |
YOLOv8s | 50.2 | 38.0 | 38.8 | 11.17 |
YOLOv8-FasterNet[ | 48.8 | 36.6 | 37.1 | 8.60 |
Bi-YOLO[ | 51.6 | 39.6 | 40.7 | 6.49 |
文献[ | 50.7 | 40.4 | 41.3 | 2.62 |
Ours | 55.3 | 43.4 | 45.4 | 3.62 |
Table 6 Algorithm comparison experiment
模型 | P/% | R/% | mAP/% | Para/M |
---|---|---|---|---|
YOLOv5s | 44.9 | 35.1 | 33.7 | 7.13 |
YOLOv6s | 47.3 | 36.4 | 36.8 | 16.30 |
YOLOv7 | 46.0 | 37.9 | 34.5 | 6.04 |
YOLOv8s | 50.2 | 38.0 | 38.8 | 11.17 |
YOLOv8-FasterNet[ | 48.8 | 36.6 | 37.1 | 8.60 |
Bi-YOLO[ | 51.6 | 39.6 | 40.7 | 6.49 |
文献[ | 50.7 | 40.4 | 41.3 | 2.62 |
Ours | 55.3 | 43.4 | 45.4 | 3.62 |
模型 | P/% | R/% | mAP/% | Weight/MB |
---|---|---|---|---|
YOLOv8s | 80.6 | 75.1 | 82.0 | 22.6 |
Ours | 82.2 | 77.7 | 83.4 | 7.7 |
Table 7 Comparison experiment on TT100K datasets
模型 | P/% | R/% | mAP/% | Weight/MB |
---|---|---|---|---|
YOLOv8s | 80.6 | 75.1 | 82.0 | 22.6 |
Ours | 82.2 | 77.7 | 83.4 | 7.7 |
[1] | BOUGUETTAYA A, ZARZOUR H, KECHIDA A, et al. Vehicle detection from UAV imagery with deep learning: a review[J]. IEEE Transactions on Neural Networks and Learning Systems, 2022, 33(11): 6047-6067. |
[2] |
李利霞, 王鑫, 王军, 等. 基于特征融合与注意力机制的无人机图像小目标检测算法[J]. 图学学报, 2023, 44(4): 658-666.
DOI |
LI L X, WANG X, WANG J, et al. Small object detection algorithm in UAV image based on feature fusion and attention mechanism[J]. Journal of Graphics, 2023, 44(4): 658-666. (in Chinese) | |
[3] | CHEN Y T, LI J, NIU Y F, et al. Small object detection networks based on classification-oriented super-resolution GAN for UAV aerial imagery[C]// 2019 Chinese Control And Decision Conference. New York: IEEE Press, 2019: 4610-4615. |
[4] | 潘晓英, 贾凝心, 穆元震, 等. 小目标检测研究综述[J]. 中国图象图形学报, 2023, 28(9): 2587-2615. |
PAN X Y, JIA N X, MU Y Z, et al. Survey of small object detection[J]. Journal of Image and Graphics, 2023, 28(9): 2587-2615. (in Chinese) | |
[5] | DONG J, OTA K, DONG M X. UAV-based real-time survivor detection system in post-disaster search and rescue operations[J]. IEEE Journal on Miniaturization for Air and Space Systems, 2021, 2(4): 209-219. |
[6] |
REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149.
DOI PMID |
[7] | LIU W, ANGUELOV D, ERHAN D, et al. SSD: single shot MultiBox detector[C]// Computer Vision-ECCV 2016: 14th European Conference. Cham: Springer, 2016: 21-37. |
[8] | REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: unified, real-time object detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2016: 779-788. |
[9] | LIU W J, QIANG J, LI X X, et al. UAV image small object detection based on composite backbone network[J]. Mobile Information Systems, 2022, 2022(1): 7319529. |
[10] |
魏陈浩, 杨睿, 刘振丙, 等. 具有双层路由注意力的YOLOv8道路场景目标检测方法[J]. 图学学报, 2023, 44(6): 1104-1111.
DOI |
WEI C H, YANG R, LIU Z B, et al. YOLOv8 with bi-level routing attention for road scene object detection[J]. Journal of Graphics, 2023, 44(6): 1104-1111. (in Chinese) | |
[11] |
史涛, 崔杰, 李松. 优化改进YOLOv8实现实时无人机车辆检测的算法[J]. 计算机工程与应用, 2024, 60(9): 79-89.
DOI |
SHI T, CUI J, LI S. Algorithm for real-time vehicle detection from UAVs based on optimizing and improving YOLOv8[J]. Computer Engineering and Applications, 2024, 60(9): 79-89. (in Chinese)
DOI |
|
[12] | LI Y T, FAN Q S, HUANG H S, et al. A modified YOLOv8 detection network for UAV aerial image recognition[J]. Drones, 2023, 7(5): 304. |
[13] |
吴明杰, 云利军, 陈载清, 等. 改进YOLOv5s的无人机视角下小目标检测算法[J]. 计算机工程与应用, 2024, 60(2): 191-199.
DOI |
WU M J, YUN L J, CHEN Z Q, et al. Improved YOLOv5s small object detection algorithm in UAV view[J]. Computer Engineering and Applications, 2024, 60(2): 191-199. (in Chinese)
DOI |
|
[14] | WANG G, CHEN Y F, AN P, et al. UAV-YOLOv8: a small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios[J]. Sensors, 2023, 23(16): 7190. |
[15] | BOCHKOVSKIY A, WANG C Y, LIAO H Y M. YOLOv4: optimal speed and accuracy of object detection[EB/OL]. [2024-04-12]. https://arxiv.org/abs/2004.10934. |
[16] | WANG C Y, LIAO H Y M, WU Y H, et al. CSPNet: a new backbone that can enhance learning capability of CNN[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. New York: IEEE Press, 2020: 390-391. |
[17] | WANG C Y, BOCHKOVSKIY A, LIAO H Y M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 7464-7475. |
[18] | LIN T Y, DOLLÁR P, GIRSHICK R, et al. Feature pyramid networks for object detection[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2017: 2117-2125. |
[19] | LIU S, QI L, QIN H F, et al. Path aggregation network for instance segmentation[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 8759-8768. |
[20] | GE Z, LIU S T, WANG F, et al. YOLOX: exceeding YOLO series in 2021[EB/OL]. [2024-04-12]. https://arxiv.org/abs/2107.08430. |
[21] | KONG T, SUN F C, LIU H P, et al. FoveaBox: beyound anchor-based object detection[J]. IEEE Transactions on Image Processing, 2020, 29: 7389-7398. |
[22] | CHEN J R, KAO S H, HE H, et al. Run, don't walk: chasing higher FLOPS for faster neural networks[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 12021-12031. |
[23] | WANG C Y, YEH I H, LIAO H Y M. YOLOv9: learning what you want to learn using programmable gradient information[EB/OL]. [2024-04-12]. https://arxiv.org/abs/2402.13616. |
[24] | CHOLLET F. Xception: deep learning with depthwise separable convolutions[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2017: 1251-1258. |
[25] | TIAN Z, SHEN C H, CHEN H, et al. FCOS: a simple and strong anchor-free object detector[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(4): 1922-1933. |
[26] | TAN M X, PANG R M, LE Q V. EfficientDet: scalable and efficient object detection[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2020: 10781-10790. |
[27] | YANG G Y, LEI J, ZHU Z K, et al. AFPN: asymptotic feature pyramid network for object detection[C]// 2023 IEEE International Conference on Systems, Man, and Cybernetics. New York: IEEE, 2023: 2184-2189. |
[28] | 刘子洋, 徐慧英, 朱信忠, 等. Bi-YOLO: 一种基于YOLOv8n改进的轻量化目标检测算法[J]. 计算机工程与科学, 2024, 46(8): 1444-1454. |
LIU Z Y, XU H Y, ZHU X Z, et al. Bi-YOLO: an improved lightweight object detection algorithm based on YOLOv8n[J]. Computer Engineering & Science, 2024, 46(8): 1444-1454. (in Chinese) | |
[29] |
朱强军, 胡斌, 汪慧兰, 等. 基于轻量化YOLOv8s交通标志的检测[J]. 图学学报, 2024, 45(3): 422-432.
DOI |
ZHU Q J, HU B, WANG H L, et al. Detection of traffic signs based on lightweight YOLOv8s[J]. Journal of Graphics, 2024, 45(3): 422-432. (in Chinese)
DOI |
[1] | LI Zhenfeng, FU Shichen, XU Le, MENG Bo, ZHANG Xin, QING Jianjun. Research on gangue target detection algorithm based on MBI-YOLOv8 [J]. Journal of Graphics, 2024, 45(6): 1301-1312. |
[2] | HU Fengkuo, YE Lan, TAN Xianfeng, ZHANG Qinzhan, HU Zhixin, FANG Qing, WANG Lei, MAN Xiaofeng. A refined YOLOv8-based algorithm for lightweight pavement disease detection [J]. Journal of Graphics, 2024, 45(5): 892-900. |
[3] | WANG Yaru, FENG Lilong, SONG Xiaoke, QU Zhuo, YANG Ke, WANG Qianming, ZHAI Yongjie. TFD-YOLOv8: a transmission line foreign object detection method [J]. Journal of Graphics, 2024, 45(5): 901-912. |
[4] | LIU Yiyan, HAO Tingnan, HE Chen, CHANG Yingjie. Photovoltaic cell surface defect detection based on DBBR-YOLO [J]. Journal of Graphics, 2024, 45(5): 913-921. |
[5] | WU Peichen, YUAN Lining, HU Hao, LIU Zhao, GUO Fang. Video anomaly detection based on attention feature fusion [J]. Journal of Graphics, 2024, 45(5): 922-929. |
[6] | LIU Li, ZHANG Qifan, BAI Yuang, HUANG Kaiye. Research on multi-scale remote sensing image change detection using Swin Transformer [J]. Journal of Graphics, 2024, 45(5): 941-956. |
[7] | ZHANG Dongping, WEI Yangyue, HE Shuji, XU Yunchao, HU Haimiao, HUANG Wenjun. Feature fusion and inter-layer transmission: an improved object detection method based on Anchor DETR [J]. Journal of Graphics, 2024, 45(5): 968-978. |
[8] | LI Gang, CAI Zehao, SUN Huaxun, ZHAO Zhenbing. Research on defect detection of transmission line fittings based on improved YOLOv8 and semantic knowledge fusion [J]. Journal of Graphics, 2024, 45(5): 979-986. |
[9] | LI Jianhua, HAN Yu, SHI Kaiming, ZHANG Kejia, GUO Hongling, FANG Dongping, CAO Jiaming. Small-target worker detection on construction sites [J]. Journal of Graphics, 2024, 45(5): 1040-1049. |
[10] | LUO Zhihui, HU Haitao, MA Xiaofeng, CHENG Wengang. A network based on the homogeneous middle modality for cross-modality person re-identification [J]. Journal of Graphics, 2024, 45(4): 670-682. |
[11] | HU Xin, CHANG Yashu, QIN Hao, XIAO Jian, CHENG Hongliang. Binocular ranging method based on improved YOLOv8 and GMM image point set matching [J]. Journal of Graphics, 2024, 45(4): 714-725. |
[12] | NIU Weihua, GUO Xun. Rotating target detection algorithm in ship remote sensing images based on YOLOv8 [J]. Journal of Graphics, 2024, 45(4): 726-735. |
[13] | ZENG Zhichao, XU Yue, WANG Jingyu, YE Yuanlong, HUANG Zhikai, WANG Huan. A water surface target detection algorithm based on SOE-YOLO lightweight network [J]. Journal of Graphics, 2024, 45(4): 736-744. |
[14] | WU Bing, TIAN Ying. Research on multi-scale road damage detection algorithm based on attention mechanism [J]. Journal of Graphics, 2024, 45(4): 770-778. |
[15] | ZHU Qiangjun, HU Bin, WANG Huilan, WANG Yang. Detection of traffic signs based on lightweight YOLOv8s [J]. Journal of Graphics, 2024, 45(3): 422-432. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||