面向红外小目标检测的多尺度特征融合算法设计

Design of Multi-scale Feature Fusion Algorithms for Infrared Small Target Detection

  • 摘要: 为解决红外小目标分辨率低、成像模糊,容易受噪声影响等检测难题,本文基于YOLOv8n提出了一种多尺度特征融合的红外小目标检测算法YOLO-IRTD( YOLO-InfraRed Tiny Detect)。算法首先对YOLOv8的特征融合网络进行重构,提出了一种特征聚焦扩散金字塔网络,增强了网络对小目标特征的提取能力;设计了自适应动态检测头,通过促进分类分支和定位分支之间的信息交互,在降低模型复杂度的同时提升检测精度;最后将SimAM注意力机制融入星型块中,设计出Star-SimAM模块,替换主干网络中的C2f模块,在降低参数量的同时进一步增强网络的特征提取能力。经实验验证,YOLO-IRTD模型参数量下降20%,权重大小仅为5MB,在HIT-UAV数据集上mAP高达90.4%,相比Base模型提升了3.6%,对比YOLO系列模型提升1%~4%,在FLIR数据集上mAP提升1.2%,泛化性能优异,模型各方面指标在目前主流检测模型中均处于领先地位。

     

    Abstract: To address the challenges of low resolution, blurry imaging, and noise susceptibility in infrared small target detection, this paper proposes a multi-scale feature fusion infrared small target detection algorithm, YOLO-IRTD (YOLO-InfraRed Tiny Detect), based on YOLOv8n. The algorithm first reconstructs the feature fusion network of YOLOv8 and introduces a feature-focused diffusion pyramid network to enhance the network’s ability to extract small target features. It also designs an adaptive dynamic detection head that promotes information interaction between the classification and localization branches, improving detection accuracy while reducing model complexity. Finally, the SimAM attention mechanism is integrated into the Star Block to form the Star-SimAM module, which replaces the C2f module in the backbone network, further enhancing feature extraction capabilities while reducing the model's parameter count. Experimental results show that the YOLO-IRTD model reduces the parameter count by 20%, with a weight size of only 5 MB. It achieves a mAP of 90.4% on the HIT-UAV dataset, improving by 3.6% over the Base model and outperforming YOLO series models by 1%-4%. On the FLIR dataset, the mAP improves by 1.2%. The model demonstrates excellent generalization performance, and its overall metrics are among the best in current mainstream detection models.

     

/

返回文章
返回