基于多注意力机制的红外与可见光图像夜间目标检测

Nighttime Object Detection in Infrared and Visible Images Based on Multi-Attention Mechanism

  • 摘要: 目标检测一直是计算机视觉领域的研究热点,YOLO系列目标检测模型已广泛应用于多个领域。然而,目前关于目标检测的图像数据大多是基于单一类型传感器,难以完整地表征成像场景,且检测到的目标所包含有用信息具有局限性,尤其是在低照度、夜晚、雨雾等条件下,目标检测更加困难。为了更好地检测夜间目标,本文提出了一种结合CBAM注意力机制与Transformer的多注意力机制的红外与可见光图像夜间目标检测方法,通过添加Transformer来获取丰富的局部和上下文信息,通过添加CBAM注意力机制来减少误检。为了验证方法的有效性,本文选取了5种当前主流的目标检测算法在公开红外目标检测数据集上进行测试,本文方法与原始YOLO v7相比,mAP从62.6%提升至71.5%。本文还制作了一个用于夜间目标检测红外-可见光融合目标检测数据集。在该数据集上与原始YOLOv7相比,mAP从79.90%提升至94.80%,效果非常显著。

     

    Abstract: Object detection has long been a research hotspot in the field of computer vision, and the YOLO series of object detection models is widely used in numerous fields. However, most current image data for object detection are based on a single type of sensor, which makes it difficult to fully characterize the imaging scene. The detected objects contain limited useful information, especially under conditions of low illumination, night, rain, and fog. To improve nighttime object detection, our study proposed a multi-attention mechanism for infrared and visible images. This mechanism combines the CBAM attention mechanism with a Transformer to obtain rich local and contextual information and reduce false detections. To verify the effectiveness of the method, five current mainstream object detection algorithms were selected and tested on a public infrared object detection dataset. The mAP of the proposed method improved from 62.6% to 71.5% compared to the original YOLOv7. This study also produced an infrared–visible fusion dataset for nighttime object detection. On this dataset, the mAP improved significantly from 79.90% to 94.80% compared to the original YOLOv7.

     

/

返回文章
返回