基于扩散模型的红外小目标检测

Diffusion Model for Infrared Small Target Detection

  • 摘要: 红外小目标检测作为一项复杂且关键的计算机视觉任务,面临着目标尺寸微小、对比度低、背景噪声干扰强烈及数据稀缺等多重挑战,这些问题极大地制约了检测精度与实时性。现有基于深度学习的算法大多基于分割范式,通过设计结构较深的编码器-解码器网络实现分割掩码的生成,由于缺乏足够的特征表示和学习能力,在应对各种复杂场景时检测精度较低。鉴于此,受启发于人工智能领域扩散模型技术所取得的巨大成功,本文提供了一种新的解决思路,将红外小目标检测问题描述为生成式任务,并提出了一个条件去噪网络diff-ISTD。该网络利用逐步去噪与重建优势,挖掘图像内在深层次统计特性,从而能够更精确地区分并捕获微弱且易于混淆的小目标特征。具体来说,该网络包含条件分支网络以及去噪分支网络,分别用于充分提取红外图像的先验知识和细化含有噪声的掩码。此外,本文还设计了一种并行双维自注意力计算(PDSA)模块,融合空间与通道维度分析,极大增强了模型对全局结构和局部细节的把握力,克服了由分辨率和环境多样性引起的目标模糊难题。综合实验结果显示,diff-ISTD在面对极端检测条件时,相比目前先进的分割方法,展现出卓越的性能与更高的检测效率,为克服小目标检测领域的长期挑战开辟了新路径。

     

    Abstract: Infrared small-target detection, a complex and critical task in computer vision, faces numerous challenges—including tiny target sizes, low contrast, severe background noise, and limited data availability. These factors significantly impair detection accuracy and real-time performance. Existing deep learning–based algorithms, which predominantly adopt segmentation paradigms via deep encoder–decoder architectures for generating segmentation masks, often exhibit limited precision in complex scenarios due to inadequate feature representation and learning capabilities. Inspired by the notable success of diffusion models in artificial intelligence, this paper introduces a novel approach by reframing infrared small-target detection as a generative task and proposes a conditional denoising network, termed diff-ISTD. By leveraging the strengths of progressive denoising and image reconstruction, diff-ISTD captures the deep statistical properties of infrared images, enabling more precise identification of weak and ambiguous small-target features. The proposed network consists of conditional branching modules for extracting prior knowledge from infrared inputs and denoising branches for refining noisy segmentation masks. In addition, a parallel dual-dimensional self-attention (PDSA) block is introduced to integrate spatial and channel information, significantly enhancing the model's sensitivity to global structures and local details. This design effectively addresses the challenges of target blurring caused by resolution limitations and environmental variability. Comprehensive experiments demonstrate that, under rigorous detection conditions, diff-ISTD outperforms current state-of-the-art segmentation methods in terms of performance and detection efficiency, offering a promising direction for advancing infrared small-target detection technologies.

     

/

返回文章
返回