Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (4): 658-666.DOI: 10.11996/JG.j.2095-302X.2023040658

Previous Articles     Next Articles

Small object detection algorithm in UAV image based on feature fusion and attention mechanism

LI Li-xia1(), WANG Xin2,1,3(), WANG Jun3, ZHANG You-yuan4   

  1. 1. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin Guangxi 541010, China
    2. School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu Sichuan 610000, China
    3. School of Marine Engineering, Guilin University of Electronic Technology, Beihai Guangxi 536000, China
    4. School of Electronics and Information Engineering, Lanzhou Jiaotong University, Lanzhou Gansu 730070, China
  • Received:2022-11-18 Accepted:2023-01-18 Online:2023-08-31 Published:2023-08-16
  • Contact: WANG Xin (1976-), professor, Ph.D. His main research interests cover image processing, network information security, internet of things, data mining and other research, etc. E-mail:304379506@qq.com
  • About author:

    LI Li-xia (1995-), master student. Her main research interests cover image processing and object recognition. E-mail:20032202019@mails.guet.edu.cn

  • Supported by:
    Guangxi Science and Technology Major Project(AA19254016);Guangxi Graduate Student Innovation Project(YCSW2021174);Beihai City Science and Technology Planning Project(202082033);Beihai City Science and Technology Planning Project(202082023)

Abstract:

The task of detecting small objects in UAV aerial images is a formidable challenge due to their diminutive size and insufficient amount of feature information. To surmount this predicament, a multi-head attention mechanism was incorporated into the YOLOv5 backbone network in order to seamlessly integrate global feature information. As the network depth increased, the model tended to accentuate high-level semantic information at the expense of underlying detailed texture features vital for the detection of small objects. To address this issue, a shallow feature enhancement module was devised to acquire underlying feature information and augment small object feature information. Furthermore, a multi-level feature fusion module was developed to amalgamate feature information from different layers, thus enabling the network to dynamically adjust the weights of each output detection layer. Experimental results on the publicly available VisDrone2021 dataset demonstrated that the mean average precision of the proposed algorithm, attained a level of 45.7%, representing a 3.1% enhancement over the baseline YOLOv5 algorithm. Additionally, the proposed algorithm achieved a detection speed of 41 frames per second for high-resolution images, satisfying the requirement for real-time performance and exhibiting a noteworthy improvement in detection accuracy over other prevalent methods.

Key words: feature fusion, attention mechanism, UAV aerial imagery, small object detection, YOLOv5

CLC Number: