高永奇, 袁志祥. 基于改进YOLOv5的水下废弃物红外检测算法[J]. 红外技术, 2024, 46(9): 994-1005.
引用本文: 高永奇, 袁志祥. 基于改进YOLOv5的水下废弃物红外检测算法[J]. 红外技术, 2024, 46(9): 994-1005.
GAO Yongqi, YUAN Zhixiang. Improved YOLOv5-based Underwater Infrared Garbage Detection Algorithm[J]. Infrared Technology , 2024, 46(9): 994-1005.
Citation: GAO Yongqi, YUAN Zhixiang. Improved YOLOv5-based Underwater Infrared Garbage Detection Algorithm[J]. Infrared Technology , 2024, 46(9): 994-1005.

基于改进YOLOv5的水下废弃物红外检测算法

Improved YOLOv5-based Underwater Infrared Garbage Detection Algorithm

  • 摘要: 针对水下废弃物红外目标检测中出现的检测目标边界细节模糊、图像质量低和存在各种不规则形状或损坏的覆盖物等问题,本文提出了一种基于YOLOv5的改进目标检测方法(EFDCD-YOLO)。在主干网络中选择InceptionNeXt网络,以增强模型的表达能力和特征提取能力。其次,在特征融合层中通过加入EffectiveSE注意力机制,自适应地学习特征通道的重要性,并进行选择性加权。采用可变形卷积替代原模型中的C3模块,使模型能够更好地感知目标的形状和细节信息。此外,将CARAFE算子替代上采样模块,增强对细粒度特征的表现能力,避免信息丢失。在损失函数方面,采用Focal-EIOU损失函数,以提高模型对目标定位和边界框回归的准确性。最后,引入DyHead替换YOLOv5中的头部,通过动态感受野机制和多尺度的特征融合方式,提升模型的准确性。将改进后的EFDCD-YOLO模型应用于水下废弃物红外目标检测,相比于YOLOv5模型,改进后的模型在准确率(P)、召回率(R)和平均精度(mAP)方面分别提升了21.4%、9.7%和13.6%。实验结果表明,EFDCD-YOLO能够有效地提升水下废弃物红外目标检测场景的性能,更好地满足水下废弃物红外目标检测的需求。

     

    Abstract: An improved object detection method (YOLO with EffectiveSE, Focal-EIOU, DCNv2, CARAFE, and DyHead) is proposed based on YOLOv5 to address issues in underwater waste infrared target detection, such as blurred boundary details, low image quality, and the presence of various irregular or damaged coverings. The InceptionNeXt network is selected as the backbone network to enhance the model's expressive power and feature extraction capability. Additionally, the EffectiveSE attention mechanism is introduced in the feature fusion layer to adaptively learn the importance of feature channels and selectively weight them. Deformable convolutions are used to replace the C3 module in the original model, enabling it to better perceive the shapes and details of the targets. Moreover, the CARAFE operator is employed to replace the upsampling module, thereby enhancing the representation ability of the fine-grained features and avoiding information loss. In terms of the loss function, the Focal-EIOU loss function is adopted to improve the accuracy of the model in target localization and bounding box regression. Finally, DyHead is introduced to replace the head of YOLOv5, thereby enhancing the model accuracy via dynamic receptive field mechanisms and multiscale feature fusion. The improved EFDCD-YOLO model is applied to underwater waste infrared target detection and compared to the YOLOv5 model. The model achieves a 21.4% improvement in precision (P), 9.7% improvement in recall (R), and 13.6% improvement in mean average precision (mAP). The experimental results demonstrate that EFDCD-YOLO effectively enhances the detection performance in underwater waste infrared target detection scenarios and effectively meets the requirements of underwater infrared target detection.

     

/

返回文章
返回