融合红外与可见光的双流架构矿井下目标检测算法

Object Detection Algorithm for Underground Targets in Mines with a Dual-Stream Architecture Fusing Infrared and Visible Light

  • 摘要: 强光扰、低光照等恶劣条件是矿井下目标检测的一大困扰,单一可见光的检测时常出现漏检或误检的情况,现有的可见光与红外图像融合算法往往无法更好地同时提取两种图像的特征。针对上述问题,提出了一种可见光与红外图像多尺度融合目标检测算法DF-YOLOv8。通过构建双流特征提取器架构,分别对低分辨率红外图像和可见光图像进行特征提取;采用双线性插值法对特征图进行上采样,通过通道注意力机制进行特征图的融合处理;引入融合加权特征损失函数和一致性损失策略,优化模型的适应性和鲁棒性。消融实验与对比实验的结果表明,采用上述方法,模型在自建数据集Coal-Mine Video上的平均精度均值(mAP)达到87.9%,相较于YOLOv7、YOLOv8分别提升6.0%、5.7%,检测速度达到67FPS,确保实时监测的需求。

     

    Abstract: Harsh conditions such as strong light interference and low illumination are major challenges for underground target detection in mines. Detection using only visible light often results in missed or false detections. Existing algorithms for fusing visible and infrared images often fail to effectively extract features from both types of images simultaneously. To address this issue, we propose a multi-scale fusion target detection algorithm, DF-YOLOv8, which combines visible and infrared images. By constructing a dual-stream feature extractor architecture, we separately extract features from low-resolution infrared and visible light images. We use bilinear interpolation for feature map upsampling and apply a channel attention mechanism for feature map fusion. The introduction of a fusion weighted feature loss function and a consistency loss strategy optimizes the model's adaptability and robustness. Results from ablation experiments and comparison experiments show that using the proposed method, the model achieves a mean average precision (mAP) of 87.9% on our self-built Coal-Mine Video dataset, representing an improvement of 6.0% and 5.7% compared to YOLOv7 and YOLOv8, respectively. Additionally, it meets real-time monitoring requirements with a detection speed of 67 FPS.

     

/

返回文章
返回