基于YOLOv10n的轻量化无人机航拍目标检测模型

廖晓娟; 唐锡雷

基于YOLOv10n的轻量化无人机航拍目标检测模型

Lightweight UAV Aerial Object Detection Model Based on YOLOv10n

摘要

摘要: 针对无人机航拍图像目标检测中计算资源受限和多尺度小目标检测难度大，基于YOLOv10n模型，提出一种轻量化检测模型YOLOv10n-CIG。首先，设计C2f-CW（C2f with convolutional wise）替换C2f模块，通过结合部分卷积和逐点卷积优化计算资源，提升推理速度并增强多尺度特征融合效果。其次，去除Backbone的最后一次下采样层，并改进SPPF为SPPF-IP（SPPF with involution parallel structure），以保留小目标的细粒度空间信息，进一步提高多尺度特征融合性能。最后，引入了基于组卷积的轻量化检测头GHead（GConv Head，GHead），通过优化组卷积参数，使得检测精度、模型大小与推理速度之间达到了平衡。实验结果表明，YOLOv10n-CIG模型相较于原YOLOv10n模型而言，在mAP50上提升了5.3%，在模型大小上减少了1.12 MB，在推理速度上，分别在Ubuntu和Jetson提升59 FPS和9 FPS。与当前主流算法相比，YOLOv10n-CIG在各项指标上综合表现较好。

Abstract: To address the challenges of limited computational resources and multiscale small object detection in drone aerial imagery, a lightweight detection model, YOLOv10n-CIG, is proposed based on the YOLOv10n architecture. First, the C2f-CW (C2f with Convolutional Wise) module is designed to replace the conventional C2f module. By combining Partial Convolution (PConv) and Pointwise Convolution (PWConv), this new module optimizes computational resources, accelerates inference speed, and enhances multi-scale feature fusion. Second, the last downsampling layer in the Backbone is removed, and the Spatial Pyramid Pooling Fast (SPPF) module is improved to SPPF with Involution Parallel Structure (SPPF-IP) to retain the fine-grained spatial information of small targets, further improving the multi-scale feature fusion performance. Finally, a lightweight detection head, GHead (GConv Head), based on Group Convolution (GConv), is introduced. By optimizing the parameters of the group convolution, a balance between the detection accuracy, model size, and inference speed is achieved. The experimental results indicate that compared with the original YOLOv10n model, the YOLOv10n-CIG model achieves a 5.3% improvement in mAP50, reduces the model size by 1.12 MB, and increases the inference speed by 59 FPS on Ubuntu and 9 FPS on Jetson. Compared with current mainstream algorithms, YOLOv10n-CIG exhibits superior overall performance across various metrics.

HTML全文

参考文献(16)

施引文献

资源附件(0)