SR-Unet:用于红外小目标检测的切分滚动网络

SR-Unet: A Split Rolling Network for Infrared Small Target Detection

  • 摘要: 针对目前基于CNN的红外小目标检测方法难以捕获远程依赖关系,而基于Transformer的方法计算复杂度高,局部特征学习效果差的问题,为了有效提取并融合局部特征和远程依赖关系,本文提出一种结合MLP与CNN的红外小目标检测网络算法SR-Unet(Split Rolling-Unet)。该算法在Rolling-Unet的基础上,添加多尺度深度监督融合,通过构建多方向切分正交滚动MLP和局部信息提取模块,在捕获多方向远程依赖的同时对局部上下文信息进行整合。论文在公开数据集NUAA-SIRST和NUDT-SIRST上进行对比实验,结果表明SR-Unet参数量仅有2.01M,且多个评估指标优于目前主流的红外小目标检测算法。通过消融实验表明,改进后算法的IoU由0.7463提升到0.7851,F1由0.8547提升到0.8796,Pd由93.92%提升到96.2%。SR-Unet在红外小目标检测上具有更高的检测精度,且检测概率大,综合性能好。

     

    Abstract: To address the challenges faced by current CNN-based infrared small-target detection methods in capturing long-range dependencies, as well as the high computational complexity and poor local feature learning of transformer-based methods, this study proposes an infrared small-target detection network algorithm called Split Rolling-Unet (SR-Unet) that combines MLP and CNN. Based on the Rolling-Unet, this algorithm adds a multiscale deep supervision fusion. By constructing the MSORMLP and LIEM, multidirectional long-range dependencies were captured while integrating the local context information. Comparative experiments were conducted on the public NUAA-SIRST and NUDT-SIRST datasets. The results demonstrate that SR-Unet contains only 2.01 M parameters while outperforming current mainstream infrared small-target detection algorithms across multiple evaluation metrics. Ablation experiments showed that the improved algorithm increased IoU from 0.7463 to 0.7851, F1 score from 0.8547 to 0.8796, and Pd from 93.92% to 96.2%. SR-Unet demonstrates higher detection accuracy, a higher detection probability, and overall superior performance in infrared small-target detection tasks.

     

/

返回文章
返回