细节与全局多级融合的红外无人机目标检测

Infrared UAV Object Detection Based on Detailed and Global Multistage Fusion

  • 摘要: 基于红外无人机平台的目标检测技术在国防军事和应急救援领域具有广泛应用,但面临尺寸小和背景复杂等挑战,目标上的细节特征易被忽视。构建细节增强网络和多尺度跨空间注意力网络,通过在特征提取过程中增强低分辨率细节,以及特征融合过程中同时捕获局部和全局的空间语义信息,提升检测精度。首先,细节增强网络从输入图像中提取不同尺度的特征图,这些特征图尽可能多地增强原始图像中的细节信息。其次,多尺度跨空间注意力网络同时捕捉全局和局部的语义特征,用于多尺度特征融合。最后,检测头采用了考虑框之间顶点对距离的边界框回归损失函数,度量框之间的相似性。实验结果表明,所提模型DM-IUAV(l)的mAP@0.5在Drone_Vehicle中的红外数据集为81.2%,在HIT-UAV数据集上为74.4%。

     

    Abstract: The technology of infrared target detection using drones has wide applications in the fields of national defense and emergency rescue. However, it faces challenges such as the small size of relevant targets and complex backgrounds, which makes it easy to miss the detailed features of a target. By constructing detail enhancement and multiscale cross-space attention network models, detection accuracy can be improved by enhancing low-resolution details in the feature extraction process and capturing both local and global spatial semantic information in the feature fusion process. First, the detail enhancement network extracts feature maps of different scales from the input image to enhance detail information in the original image as much as possible. Second, the multiscale cross-space attention network captures both global and local semantic features simultaneously and performs multiscale feature fusion. Finally, the detection head uses a bounding box regression loss function that measures the similarity between adjacent boxes, considering the distance between the vertices of the bounding boxes. The experimental results show that the mAP@0.5 of the proposed model DM-IUAV(l) on the infrared dataset of Drone_Vehicle was 81.2%, with a value of 74.4% on the HIT-UAV dataset.

     

/

返回文章
返回