基于YOLO v8的航拍图像小目标检测算法

Small-Target Detection Algorithm for Aerial Images Based on YOLO v8

  • 摘要: 针对航拍图像中小目标存在分辨率低、密集、背景相似等问题,本文提出航拍小目标检测算法TINY-YOLO v8。首先,为提高对小目标的检测能力,在YOLO v8模型基础上增加一个小目标检测层,并优化主干部分网络结构来提高精度;其次,将主干网络中部分卷积替换为轻量级卷积,并将颈部网络中C2f模块替换为轻量级模块VoV-GSCSP来减少模型参数量和计算量;然后,在网络结构中引入ECA注意力机制,增强特征图在通道层面上的信息交互能力,使探测头提取到更为细腻的目标特征;最后,设计Inner-MPDIoU损失函数来更好地加速模型收敛状况,提升对目标的定位能力。实验结果表明,本文提出的TINY-YOLO v8算法相比于其它目标检测算法有着更为出色的检测效果。与基准算法对比,本文提出的模型在VisDrone数据集和TinyPerson数据集上mAP50分别提高5.2%、8%,且参数量相较于原模型下降10.1%。因此,本文所提出的算法可以很好地运用在航拍图像检测领域中。

     

    Abstract: Aiming at the challenges of low resolution, high target density, and background similarity in aerial small-target detection, this study proposes an improved detection algorithm, TINY-YOLOv8. First, to enhance the detection capability for small targets, an additional small-target detection layer is incorporated into the YOLOv8 model, and the backbone network structure is optimized to improve detection accuracy. Second, partial convolutions in the backbone network are replaced with lightweight convolutions, and the C2f module in the neck is substituted with a lightweight VoV-GSCSP module to reduce model parameters and computational complexity. Furthermore, the ECA attention mechanism is integrated into the network to strengthen channel-wise feature interactions, enabling the detection head to extract more delicate target features. Finally, an Inner-MPDIoU loss function is designed to accelerate model convergence and enhance target localization accuracy. Experimental results show that the proposed TINY-YOLOv8 algorithm outperforms other target detection methods. Compared with the benchmark model, the proposed approach improves mAP50 by 5.2% and 8% on the VisDrone and TinyPerson datasets, respectively, while reducing the number of parameters by 10.1%. These results indicate that the proposed algorithm is well-suited for aerial image target detection applications.

     

/

返回文章
返回