结合交叉注意力的红外小目标检测

Infrared Small Object Detection with Cross-attention

  • 摘要: 红外小目标检测在军事侦察、航空航天监测及安防等领域具有重要意义,特别是在低照度及复杂气候环境下,利用红外热成像技术可以精准捕捉并识别远方微小且信号弱的目标。针对红外场景下小目标特征微弱、检测精度低等问题,提出了一种改进的CTCA-YOLOv8算法。首先增加了小目标层并删除针对大目标的检测层,使得在显著降低参数量的情况下改善小目标的检测效果,然后分别添加BoT模块、跨层特征交互融合模块DFF以及交叉注意力机制CTCA,通过获取全局特征信息和融合不同层级的特征,CTCA-YOLOv8针对红外小目标具有更好检测效果。实验结果表明在FLIRv2数据集上,相比YOLOv8n模型,CTCA-YOLOv8的mAP50指标提升了5.3%,计算量增加了3.5GFLOPs,参数量相比YOLOv8n下降了37%,CTCA-YOLOv8在仍能保持轻量化的前提下,目标检测精确度得到了有效提升。

     

    Abstract: Detecting small objects by their infrared signatures is of considerable importance in military reconnaissance, aerospace monitoring, and security, especially in low-light and complex weather conditions. Leveraging infrared thermal imaging technology enables relatively precise detection and identification of distant, faint targets. An enhanced CTCA-YOLOv8 algorithm is proposed to address the challenges of weak characteristics and low detection accuracy for small targets in infrared scenes. First, a layer is added to detect small targets while removing detection layers tailored for large targets to enhance detection of small targets while significantly reducing the parameter count of the model. Subsequently, BoT module, DFF and CTCA are incorporated to acquire global feature information and fuse features across different hierarchies. CTCA-YOLOv8 exhibited superior performance in terms of detecting small infrared targets. Experimental results on the FLIRv2 dataset demonstrate a 5.3% improvement in mAP50 metric compared to YOLOv8n model, with an increase in computational complexity of 3.5GFLOPs and a 37% reduction in parameters compared to YOLOv8n. CTCA-YOLOv8 achieved enhanced object detection accuracy while maintaining a lightweight design.

     

/

返回文章
返回