基于改进YOLOv5s的车载红外图像目标检测算法

姜宇; 刘冉冉; 李杭宇; 李峰; 郭威

基于改进YOLOv5s的车载红外图像目标检测算法

Vehicle-Infrared-Image Target Detection Algorithm Based on Improved YOLOv5s

摘要

摘要: 车载红外图像可以在夜间和恶劣天气下帮助驾驶员识别道路上的行人和其他车辆，减少交通事故的发生。针对YOLOv5s算法对车辆红外图像检测准确率低的问题，提出了一种改进YOLOv5s的车载红外图像目标检测算法。首先设计出一种感受野增强结构RFENeck模块，通过替换C3中的BottleNeck增强特征融合网络感受野区域，提高检测精度。其次，采用一种结合注意力机制的动态目标检测头，提升检测头的表达能力。最后，为了消除改进导致的模型大小的增加，使用一种结合CNN（Convolutional Neural Network）和Transformer级联设计的高效倒残差移动模块组成主干网络，在准确率不降低的同时减少网络参数量和计算量。实验结果显示，改进算法相较YOLOv5s平均检测精度从82.9%提升到85.0%，运算量减少了5.7%，模型权重减少了0.4 M，满足模型大小与精度的需求。

Abstract: In-vehicle infrared images can help drivers identify pedestrians and other vehicles on the road at night and during bad weather, thereby reducing traffic accidents. To address the low detection accuracy of vehicle infrared images using the YOLOv5s algorithm, an improved YOLOv5s algorithm for vehicle infrared image target detection is proposed. First, a receptive field enhancement structure, namely an RFENeck module, is designed, which replaces the BottleNeck module in C3, to enhance the receptive field area of the feature fusion network and thus improve the detection accuracy. Second, a dynamic object detection head, combined with an attention mechanism, is used to improve the expression ability of the detection head. Finally, to eliminate the increase in model size caused by the improvement, an efficient backward residual mobile module, combined with the cascade designs of convolutional neural networks and Transformer, is used to form the backbone network. This module can reduce the number of network parameters and calculation steps without reducing the accuracy. The experimental results show that compared to YOLOv5s, the average detection accuracy of the improved algorithm increases from 82.9% to 85.0%; in addition, the computation amount is reduced by 5.7%, and the model weight is reduced by 0.4 M. These results indicate that the proposed algorithm fulfils the requirements of model size and accuracy.

HTML全文

参考文献(20)

施引文献

资源附件(0)