轻量级红外小目标检测方法

Lightweight Infrared Small Target Detection Method

  • 摘要: 针对红外图像背景复杂、信噪比低、检测目标尺寸小和亮度弱等检测难点,提出一种基于YOLOv7s的轻量级红外小目标检测算法ISTD-YOLO(Infrared Small Target Detection-You Only Look Once)。首先,对YOLOv7s网络结构进行轻量化重构,分别将特征提取网络和特征融合网络重新调整,设计出一种三尺度轻量级网络架构,提高对小目标的检测性能;然后,采用VoV-GSCSP来取代模型颈部网络的ELAN-W模块,以降低计算成本和网络结构的复杂性,提高推理速度;其次,在颈部网络中引入一种无参注意力机制,增强局部上下文信息的关联性,更准确地提取目标的定位;最后,选用归一化高斯Wasserstein距离(Normalized Gaussian Wasserstein Distance, NWD)优化常用的IoU指标,来计算预测框与真实框之间的重叠关系,增强对小目标的定位和检测精度。实验结果表明,ISTD-YOLO可以有效改善检测效果,对比基线模型,在HIT-UAV与IDSAT数据集上的检测精度分别提高8.52%与4.77%;模型体积仅有21.8 MB,参数量减少69.8%,计算量下降17.6%;相较于当下主流算法,ISTD-YOLO在各方面指标均得到有效改善,能够实现对红外小目标的高质量检测。

     

    Abstract: Detecting small targets in infrared images is challenging owing to complex backgrounds, low signal-to-noise ratios, small target sizes, and weak brightness. To address these challenges, a lightweight infrared small target detection algorithm, Infrared Small Target Detection–You Only Look Once (ISTD–YOLO), is proposed based on YOLOv7s. The YOLOv7s network structure is reconstructed in a lightweight manner by adjusting the feature extraction and feature fusion networks, and a three-scale lightweight architecture is designed to improve the detection performance of small targets. Next, the VoV-GSCSP module is adopted to replace the ELAN-W module in the neck network, aiming to reduce computational cost and network complexity while improving inference speed. In addition, a non-parametric attention mechanism is incorporated into the neck network to strengthen local contextual correlations and enable more accurate target localization. Finally, the Normalized Gaussian Wasserstein Distance (NWD) is employed to optimize the commonly used IoU metric for calculating the overlap relationship between predicted box and the ground-truth boxes, thereby enhancing the accuracy of small target localization and detection. Experimental results demonstrate that ISTD-YOLO effectively improves detection performance. Compared with the baseline model, detection accuracy on the HIT-UAV and IDSAT datasets increased by 8.52% and 4.77%, respectively. The model volume is only 21.8 MB, with a 69.8% reduction in parameter count and a 17.6% decrease in computational complexity. Furthermore, compared with current mainstream algorithms, ISTD-YOLO significantly improves multiple performance indicators and achieves high-quality detection of small infrared targets.

     

/

返回文章
返回