GAO Mingming, LI Yuanzhou, MA Lei, NAN Jingchang, ZHOU Qianyi. YOLOv5-LR: A Rotating Object Detection Model for Remote Sensing Images[J]. Infrared Technology , 2024, 46(1): 43-51.
Citation: GAO Mingming, LI Yuanzhou, MA Lei, NAN Jingchang, ZHOU Qianyi. YOLOv5-LR: A Rotating Object Detection Model for Remote Sensing Images[J]. Infrared Technology , 2024, 46(1): 43-51.

YOLOv5-LR: A Rotating Object Detection Model for Remote Sensing Images

More Information
  • Received Date: November 17, 2022
  • Revised Date: December 29, 2022
  • In a real remote sensing image, the target is distributed in any direction and it is difficult for the original YOLOv5 network to accurately express the location and range of the target and the detection speed is moderate. To solve these problems, a remote sensing image rotating target detection model, YOLOv5-Left-Rotation, was proposed. First, the transformer self-attention mechanism was used to make the model pay more attention to the targets of interest. In addition, Mosaic data were enhanced in the image preprocessing, and the improved Non-Maximum Suppression algorithm was used in post-processing. Second, an angle loss function was introduced to increase the output dimensions of the network, and the prediction box of the rotating rectangle was obtained. Finally, in the shallow stage of the network model, a sliding window branch was added to improve the detection efficiency of large-sized remote sensing sparse targets. The experimental datasets were the self-made aircraft dataset CASIA-plane78 and the public ship dataset HRSC2016. The results show that the average accuracy of the improved rotating target detection algorithm is improved by 3.175% compared with that of the original model, and the reasoning speed is improved by 13.6% in a large multispectral image swept by a Jilin-1 satellite. It can optimally reduce the redundant background information and more accurately detect the densely arranged and irregularly distributed areas of objects of interest in optical remote sensing images.
  • [1]
    ZHANG X, CHEN G, LI X, et al. Multi-oriented rotation-equivariant network for object detection on remote sensing images[J]. IEEE Geoscience and Remote Sensing Letters, 2022, 19: 1-5.
    [2]
    WANG Yi, Syed M A B, Mahrukh K, et al. Remote sensing image super-resolution and object detection: Benchmark and state of the art[J]. Expert Systems with Applications, 2022, 197: 116793. DOI: 10.1016/j.eswa.2022.116793
    [3]
    XI Y Y, JI L Y, YANG W T, et al. Multitarget detection algorithms for multitemporal remote sensing data[J]. IEEE Transactions on Geoscience and Remote Sensing, 2022, 60: 1-15.
    [4]
    WANG Y Q, MA L, TIAN Y. State-of-the-art of ship detection and recognition in optical remotely sensed imagery[J]. Acta Automatica Sinica, 2011, 37(9): 1029-1039.
    [5]
    Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks [J]. Communications of the ACM, 2017, 60(6): 84-90. DOI: 10.1145/3065386
    [6]
    WANG W, FU Y, DONG F, et al. Semantic segmentation of remote sensing ship image via a convolutional neural networks model[J]. IET Image Processing, 2019, 13(6): 1016-1022. DOI: 10.1049/iet-ipr.2018.5914
    [7]
    HE K, ZHANG X, REN S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916. DOI: 10.1109/TPAMI.2015.2389824
    [8]
    REN S, HE K, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6): 1137-1149. DOI: 10.1109/TPAMI.2016.2577031
    [9]
    FANG F, LI L, ZHU H, et al. Combining faster r-cnn and model-driven clustering for elongated object detection[J]. IEEE Transactions on Image Processing, 2020, 29: 2052-2065. DOI: 10.1109/TIP.2019.2947792
    [10]
    LIU W, Anguelov D, Erhan D, et al. SSD: single shot multibox detector[C]//Proc of the European Conference on Computer Vision, 2016: 21-37.
    [11]
    Redmon J, Divvala S, Girshick R, et al. You Only Look Once: unified, real time object detection[C]//Computer Vision and Pattern Recognition, 2017: 6517-6525.
    [12]
    Redmon J, Farhadi A. YOLO9000: Better, faster, stronger[C]//IEEE conference on Computer Vision and Pattern Recognition, 2017: 6517-6525.
    [13]
    Redmon J, Farhadi A. Yolov3: An incremental improvement[C]//IEEE conference on Computer Vision and Pattern Recognition, 2018, arXiv: 1804.0276.
    [14]
    Bochkovskiy A, Wang C Y, Liao H Y M. YOLOv4: Optimal Speed and Accuracy of Object Detection [C]//IEEE conference on Computer Vision and Pattern Recognition, 2020. arXiv: 2004.10934.
    [15]
    ZHU Wentao, LAN Xianchao, LUO Huanlin, et al. Remote sensing aircraft target detection based on improved faster R-CNN[J]. Computer Science, 2022, 49(6A): 378-383. DOI: 10.11896/jsjkx.210300121
    [16]
    LI D, ZHANG J. Rotating target detection for tarpaulin rope based on improved YOLOv5[C]// 5th International Conference on Artificial Intelligence and Big Data (ICAIBD), 2022: 299-303.
    [17]
    YANG X, YANG J R, YAN J C, et al. SCRDet: Towards more robust detection for small, cluttered and rotated objects[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2019: 8232-8241.
    [18]
    WANG B R, LI M. A structure to effectively prepare the data for sliding window in deep learning[C]// IEEE 6th International Conference on Signal and Image Processing (ICSIP), 2021: 1025-2018.
    [19]
    Dosovitskiy A, Beyer L, Kolesnikov A, et al. An image is worth 16×16 words: transformers for image recognition at scale[J/OL]. Computer Science, 2010, https://arxiv.org/abs/2010.11929.
    [20]
    LAN Lingxiang, CHI Mingmin. Remote sensing change detection based on feature fusion and attention network[J]. Computer Science, 2022, 49(6): 193-198.
    [21]
    LIU Z, WANG H, WENG L, et al. Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds[J]. IEEE Geoscience & Remote Sensing Letters, 2017, 13(8): 1074.
    [22]
    LI Y, LI M, LI S, et al. Improved YOLOv5 for remote sensing rotating object detection[C]//6th International Conference on Communication, Image and Signal Processing (CCISP), 2021: 64-68.
    [23]
    Institute of Automation. Chinese Academy of Sciences Remote sensing artificial intelligence algorithm competition platform[EB/OL]. https://www.rsaicp.com/portal/dataDetail?id=34.
  • Related Articles

    [1]XU Shiwen, WANG Heng, ZHANG Hua, PANG Jie. Human Fall Detection Method Based on Key Points in Infrared Images[J]. Infrared Technology , 2021, 43(10): 1003-1007.
    [2]ZHANG Zhipeng, SHAO Xuejun, PANG Qing. Research on the Key Technology of 3D Laser Inverted Scanning[J]. Infrared Technology , 2021, 43(8): 752-756.
    [3]A Method of Object Tracking Based on Feature Point Matching[J]. Infrared Technology , 2016, 38(7): 597-601.
    [4]ZHAO De-li, ZHU You-pan, LI Yan, ZENG Bang-ze, PAN Chao, LUO Lin, WU Cheng. Investigation on Infrared and Low Light Level Image Registration Algorithm Based on Point Feature and Freeman Chain Code[J]. Infrared Technology , 2015, (6): 467-471.
    [5]ZHAO De-li, ZHU You-pan, WU Cheng, LI Ze-min, ZENG Bang-ze, LUO Lin, YANG Peng-wei, WANG Bing, LI Yan. Investigation on Improved Infrared Image Registration Algorithm Based on Point Feature and Gray Feature[J]. Infrared Technology , 2014, (10): 820-826.
    [6]YU Hong-sheng, JIN Wei-qi. SIFT Key-points Self-adaptive Extraction Algorithm for Video Images[J]. Infrared Technology , 2013, (12): 768-772.
    [7]YANG Li, YANG Hua. The Key Techniques and Applications of Infrared False Target[J]. Infrared Technology , 2006, 28(9): 531-534. DOI: 10.3969/j.issn.1001-8891.2006.09.009
    [8]ZHAO Qin, ZHOU Tao, SHU Qin. Discussion of Image Registration Based on Feature Points[J]. Infrared Technology , 2006, 28(6): 327-330. DOI: 10.3969/j.issn.1001-8891.2006.06.005
    [9]Study on the Key Techniques of the Imaging Infrared Guidance for AAM[J]. Infrared Technology , 2003, 25(4): 45-48. DOI: 10.3969/j.issn.1001-8891.2003.04.011
    [10]Modification of the Infrared Point Measurement for Temperature[J]. Infrared Technology , 2002, 24(3): 49-51,55. DOI: 10.3969/j.issn.1001-8891.2002.03.013
  • Cited by

    Periodical cited type(2)

    1. 邢志坤. 基于LabVIEW的变电站移动机器人轨迹跟踪虚拟仿真系统设计. 自动化与仪表. 2024(07): 67-71 .
    2. 李辉,余大成,陈耀. 基于OWA算子和CWAA算子的变电站巡视周期优化. 广西电力. 2024(05): 50-54 .

    Other cited types(1)

Catalog

    Article views PDF downloads Cited by(3)
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return