YOLOv5-LR：一种遥感影像旋转目标检测模型

高明明; 李沅洲; 马雷; 南敬昌; 周芊邑

YOLOv5-LR：一种遥感影像旋转目标检测模型

YOLOv5-LR: A Rotating Object Detection Model for Remote Sensing Images

摘要

摘要: 真实遥感图像中，目标呈现任意方向分布的特点，原始YOLOv5网络存在难以准确表达目标的位置和范围、以及检测速度一般的问题。针对上述问题，提出一种遥感影像旋转目标检测模型YOLOv5-Left-Rotation，首先利用Transformer自注意力机制，让模型更加注意感兴趣的目标，并且在图像预处理过程中采用Mosaic数据增强，对后处理过程使用改进后的非极大值抑制算法Non-Maximum Suppression。其次，引入角度损失函数，增加网络的输出维度，得到旋转矩形的预测框。最后，在网络模型的浅层阶段，增加滑动窗口分支，来提高大尺寸遥感稀疏目标的检测效率。实验数据集为自制飞机数据集CASIA-plane78和公开的舰船数据集HRSC2016，结果表明，改进旋转目标检测算法相比于原始YOLOv5网络的平均精度提升了3.175%，在吉林一号某星推扫出的大尺寸多光谱影像中推理速度提升了13.6%，能够尽可能地减少冗余背景信息，更加准确检测出光学遥感图像中排列密集、分布无规律的感兴趣目标的区域。

Abstract: In a real remote sensing image, the target is distributed in any direction and it is difficult for the original YOLOv5 network to accurately express the location and range of the target and the detection speed is moderate. To solve these problems, a remote sensing image rotating target detection model, YOLOv5-Left-Rotation, was proposed. First, the transformer self-attention mechanism was used to make the model pay more attention to the targets of interest. In addition, Mosaic data were enhanced in the image preprocessing, and the improved Non-Maximum Suppression algorithm was used in post-processing. Second, an angle loss function was introduced to increase the output dimensions of the network, and the prediction box of the rotating rectangle was obtained. Finally, in the shallow stage of the network model, a sliding window branch was added to improve the detection efficiency of large-sized remote sensing sparse targets. The experimental datasets were the self-made aircraft dataset CASIA-plane78 and the public ship dataset HRSC2016. The results show that the average accuracy of the improved rotating target detection algorithm is improved by 3.175% compared with that of the original model, and the reasoning speed is improved by 13.6% in a large multispectral image swept by a Jilin-1 satellite. It can optimally reduce the redundant background information and more accurately detect the densely arranged and irregularly distributed areas of objects of interest in optical remote sensing images.

HTML全文

参考文献(23)

施引文献

资源附件(0)