基于多尺度特征增强的红外弱小目标检测方法

向强; 路锦正; 缪玲; 胡锟; 郑超权

基于多尺度特征增强的红外弱小目标检测方法

Infrared Dim-Small Target Detection Method Based on Multi-Scale Feature Enhancement

摘要

摘要: 红外弱小目标检测技术在防空、目标制导以及航空航天等领域具备重要的实际应用价值与深远意义。本研究针对红外弱小目标数据本身信息受限、背景复杂且目标尺寸微小等挑战，提出了一种基于多尺度特征增强的红外弱小目标检测方法。以YOLOv8网络模型为基准，为了在其骨干部分充分保留与提取更多小目标的信息，同时有效抑制目标周围的噪声干扰，设计了一种最大池化结合U型结构空洞卷积（max-pooling and u-shaped dilated convolution，MUDC）；设计了额外的颈部模块-信息聚集与重参数化模块（gathering and reconstruction parameterization，GREP），以实现对骨干中多层特征和深层特征的充分融合与增强；为了使红外数据中不同尺度目标获得更多关注度与背景区分能力，引入了通道先验卷积注意力并融入C2f结构中。实验结果显示，在经过增广的NUAA-SIRST数据集上，本研究提出的模型相较于YOLOv8n模型，在精确率P、平均均值精度mAP₅₀以及mAP_50-95等评价指标上分别得到了13.7%、8.7%与5.6%的提升。相较于其他先进方法，本文模型的检测性能达到最优水平。

Abstract: Infrared dim-small target detection technology has significant practical value and importance in applications such as air defense, target guidance, and aerospace. This study addresses key challenges, including the limited information content of infrared dim small targets, complex background interference, and small target sizes, by proposing a multi-scale feature enhancement-based detection method for infrared dim small target detection. Using the YOLOv8 network model as a baseline, a U-shaped structure that integrates pooling operations with dilated convolutions is designed within the backbone to fully retain and extract small-target features while effectively suppressing surrounding noise. In addition, an enhanced neck module, namely, the Information Gathering and Reparameterization module, is introduced to enable sufficient fusion and enhancement of multi-level and deep features extracted by the backbone. To enhance attention to targets of varying scales and improve their discrimination from complex backgrounds in infrared data, a channel-prior convolutional attention mechanism is introduced and integrated into the C2f structure. Experimental results on the augmented NUAA-SIRST dataset show that, compared with the YOLOv8n model, the proposed model achieves improvements of 13.7%, 8.7%, and 5.6% in precision(P), average precision (mAP₅₀), and mAP_50-95, respectively. Furthermore, in comparison with other state-of-the-art methods, the proposed model attains superior detection performance, reaching an optimal level.

HTML全文

参考文献(27)

施引文献

资源附件(0)