空间压缩与通道增强融合的小目标检测网络研究

陆许明; 谭显洋; 余志权

空间压缩与通道增强融合的小目标检测网络研究

Small Object Detection Network with Spatial Compression and Channel Enhancement Fusion

摘要

摘要: 针对现有遥感小目标检测算法存在对于小目标的检测精度低，会出现漏检和误检的问题，提出基于YOLOv8n改进的航拍小目标检测算法，同时借助SAHI（Slice-based Attention Hyper-Inference）切片辅助推理技术来增强对小目标的检测效果。该算法通过优化网络主干，增强高分辨率特征提取，并改进颈部结构，以提高对小目标的感知能力。为了突出目标特征同时可以抑制背景干扰的影响，利用基于感受野注意力机制RFA（Reception-Filed Attention）和CBAM（Convolutional Block Attention Module）来优化卷积模块以强调感受野的空间特征来强化对小目标特征的学习能力。最后结合SPD（space-to-depth）模块重新设计C2f构成新的FREM（Feature Reorganization and Enhancement Module）模块，减少细粒度信息的损失，有效聚合局部的特征信息。在VisDrone2019数据集上进行实验，改进模型mAP₅₀相比原YOLOv8n显著提升了9.2%，并且比目前其他主流网络有较高的精确度。同时还在Tinyperson数据集和Widerperson数据集上做了泛化通用性实验。经实验表明，优化后的模型相比原模型在小目标检测精度上有显著提升。

Abstract: To overcome the low detection accuracy of small objects in existing remote-sensing target-detection algorithms, which leads to missed and false detections, an improved aerial small target detection algorithm based on YOLOv8n is proposed. This algorithm was enhanced using the SAHI slicing auxiliary inference technique to enhance the detection of small targets. By optimizing the network backbone to enhance the high-resolution feature-extraction and refine the neck structure, the algorithm enhances the perception of small targets. To emphasize the target features while mitigating the impact of background interference, the convolutional module was optimized using receptive-field attention and convolutional block attention module mechanisms, which highlight the spatial characteristics of the receptive field to strengthen the learning capability for small target features. The C2f was reengineered with a space-to-depth module to create a new feature reorganization and enhancement module, reducing the loss of fine-grained information and effectively aggregating local feature information. Experiments on the VisDrone2019 dataset show that the improved model increased the mAP₅₀ by 9.2% compared to the original YOLOv8n and outperformed other mainstream networks in terms of accuracy. Generalization experiments were conducted using the Tinyperson and Widerperson datasets. The results indicate that the optimized model significantly improved the detection precision for small targets compared to the original model.

HTML全文

参考文献(36)

施引文献

资源附件(0)