CHEN Xiaohan, XU Yuanyuan. Infrared Multi-Scale Target Detection Algorithm Based on RCR-YOLO[J]. Infrared Technology , 2025, 47(4): 459-467.
Citation: CHEN Xiaohan, XU Yuanyuan. Infrared Multi-Scale Target Detection Algorithm Based on RCR-YOLO[J]. Infrared Technology , 2025, 47(4): 459-467.

Infrared Multi-Scale Target Detection Algorithm Based on RCR-YOLO

More Information
  • Received Date: February 28, 2024
  • Revised Date: March 31, 2024
  • Infrared target detection has been widely used in both military and civilian fields. To address the issues of missed and false detections in infrared multi-scale target detection under complex backgrounds, an improved YOLOv5s algorithm, RCR-YOLO, is proposed in this paper. First, the backbone network CSPDarkNet53 of the original YOLOv5s was replaced with ResNet50 to avoid gradient vanishing caused by the deep network and to enhance the network's feature extraction capability. Subsequently, the CA attention mechanism module was added to the end of the backbone to capture feature information from different locations. Finally, the Res2Net module was added to the neck network to improve the network's representational ability and process multi-scale feature information by introducing a multi-branch structure and progressively increasing resolution, thereby enhancing detection performance. Experimental results show that this method outperforms mainstream target detection algorithms such as Faster R-CNN, SSD, and YOLOv3. Compared to YOLOv5s, mAP50–95 increased by 1.1%, while mAP50 remained at 99.5%, indicating better detection performance. The algorithm effectively performs multi-scale infrared target detection under complex backgrounds.

  • [1]
    LI K, WANG J, Jalil H, et al. A fast and lightweight detection algorithm for passion fruit pests based on improved YOLOv5[J]. Computers and Electronics in Agriculture, 2023, 204: 107534. DOI: 10.1016/j.compag.2022.107534
    [2]
    ZHANG Y, GUO K. Power plant indicator light detection system based on improved YOLOv5[J]. Journal of Beijing Institute of Technology, 2022, 31(6): 605-612.
    [3]
    YANG H, FANG Y, LIU L, et al. Improved YOLOv5 based on feature fusion and attention mechanism and its application in continuous casting slab detection[J]. IEEE Transactions on Instrumentation and Measurement, 2023.
    [4]
    ZHONG S, ZHOU H, MA Z, et al. Multiscale contrast enhancement method for small infrared target detection[J]. Optik, 2022, 271: 170134. DOI: 10.1016/j.ijleo.2022.170134
    [5]
    贺顺, 谢永妮, 杨志伟, 等. 基于IHBF的增强局部对比度红外小目标检测方法[J]. 红外技术, 2022, 44(11): 1132-1138. http://hwjs.nvir.cn/cn/article/id/0f2609dc-79df-467e-ac1d-4d5f888850d1

    HE Shun, XIE Yongni, YANG Zhiwei, et al. IHBF-based enhanced local contrast measure method for infrared small target detection[J]. Infrared Technology, 2022, 44(11): 1132-1138. http://hwjs.nvir.cn/cn/article/id/0f2609dc-79df-467e-ac1d-4d5f888850d1
    [6]
    JIANG C, REN H, YE X, et al. Object detection from UAV thermal infrared images and videos using YOLO models[J]. International Journal of Applied Earth Observation and Geoinformation, 2022, 112: 102912. DOI: 10.1016/j.jag.2022.102912
    [7]
    CAO S, WANG T, LI T, et al. UAV small target detection algorithm based on an improved YOLOv5s model[J]. Journal of Visual Communication and Image Representation, 2023, 97: 103936. DOI: 10.1016/j.jvcir.2023.103936
    [8]
    LIU Z, GAO X, WAN Y, et al. An improved YOLOv5 method for small object detection in UAV capture scenes[J]. IEEE Access, 2023, 11: 14365-14374. DOI: 10.1109/ACCESS.2023.3241005
    [9]
    Dalal N, Triggs B. Histograms of oriented gradients for human detection[C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), 2005, 1: 886-893.
    [10]
    Felzenszwalb P, McAllester D, Ramanan D. A discriminatively trained, multiscale, deformable part model[C]//2008 IEEE Conference on Computer Vision and Pattern Recognition, 2008: 1-8.
    [11]
    Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014: 580-587.
    [12]
    Girshick R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
    [13]
    REN Shaoqing, HE Kaiming, Ross Girshick, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 39(6): 1137-1149.
    [14]
    HE K, ZHANG X, REN S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1904-1916. DOI: 10.1109/TPAMI.2015.2389824
    [15]
    LIU W, Anguelov D, Erhan D, et al. Ssd: single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, 2016: 21-37.
    [16]
    FU C Y, LIU W, Ranga A, et al. Dssd: deconvolutional single shot detector[J]. arXiv preprint arXiv:1701.06659, 2017.
    [17]
    Jeong J, Park H, Kwak N. Enhancement of SSD by concatenating feature maps for object detection[J]. arXiv preprint arXiv:1705.09587, 2017.
    [18]
    LI Z, ZHOU F. FSSD: feature fusion single shot multibox detector[J]. arXiv preprint arXiv:1712.00960, 2017.
    [19]
    Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 779-788.
    [20]
    Redmon J, Farhadi A. YOLO9000: better, faster, stronger [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 7263-7271.
    [21]
    Redmon J, Farhadi A. Yolov3: An incremental improvement[J]. arXiv preprint arXiv:1804.02767, 2018.
    [22]
    Bochkovskiy A, WANG C Y, LIAO H Y M. Yolov4: Optimal speed and accuracy of object detection[J]. arXiv preprint arXiv:2004.10934, 2020.
    [23]
    DING L, XU X, CAO Y, et al. Detection and tracking of infrared small target by jointly using SSD and pipeline filter[J]. Digital Signal Processing, 2021, 110: 102949. DOI: 10.1016/j.dsp.2020.102949
    [24]
    WEI J, SU S, ZHAO Z, et al. Infrared pedestrian detection using improved UNet and YOLO through sharing visible light domain information[J]. Measurement, 2023, 221: 113442. DOI: 10.1016/j.measurement.2023.113442
    [25]
    Terven Juan, Diana-Margarita Córdova-Esparza, et al. A comprehensive review of yolo architectures in computer vision: from yolov1 to yolov8 and yolo-nas[J]. Machine Learning and Knowledge Extraction, 2023, 5(4): 1680-1716. DOI: 10.3390/make5040083
    [26]
    HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778.
    [27]
    HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 13713-13722.
    [28]
    GAO S H, CHENG M M, ZHAO K, et al. Res2net: a new multi-scale backbone architecture[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019, 43(2): 652-662.
    [29]
    袁志安, 谷雨, 马淦. 面向多类别舰船多目标跟踪的改进CSTrack算法[J]. 光电工程, 2023, 50(12): 16-31.

    YUAN Zhian, GU Yu, MA Gan. Improved CSTrack algorithm for multi-class ship multi-object tracking[J]. Opto-Electronic Engineering, 2023, 50(12): 16-31.
  • Related Articles

    [1]LYU Zongwang, NIU Hejie, SUN Fuyan, ZHEN Tong. Review of Research on Low-Light Image Enhancement Algorithms[J]. Infrared Technology , 2025, 47(2): 165-178.
    [2]WANG Zhen, LIU Lei. Infrared Image Enhancement for Power Equipment Based on Fusion Color Model Space[J]. Infrared Technology , 2024, 46(2): 225-232.
    [3]LIU Zhengnan, LIU Chunjing. Image Enhancement Algorithm Based on Texture Prior and Color Clustering[J]. Infrared Technology , 2023, 45(9): 932-940.
    [4]LIAN Cheng, ZHANG Baohui, JIANG Yunfeng, JIANG Zhifang, ZHANG Qian, YUAN Xilin. An Infrared Image Enhancement Method Based on Semantic Segmentation[J]. Infrared Technology , 2023, 45(4): 394-401.
    [5]YOU Dazhang, TAO Jiatao, ZHANG Yepeng, ZHANG Min. Low-light Image Enhancement Based on Gray Scale Transformation and Improved Retinex[J]. Infrared Technology , 2023, 45(2): 161-170.
    [6]MA Lu. Low-light Image Enhancement Based on Multi-scale Wavelet U-Net[J]. Infrared Technology , 2022, 44(4): 410-420.
    [7]CHEN Wenyi, YANG Chengxun, YANG Hui. Multiscale Retinex Infrared Image Enhancement Based on the Fusion of Guided Filtering and Logarithmic Transformation Algorithm[J]. Infrared Technology , 2022, 44(4): 397-403.
    [8]CHENG Tiedong, LU Xiaoliang, YI Qiwen, TAO Zhengliang, ZHANG Zhizhao. Research on Infrared Image Enhancement Method Combined with Single-scale Retinex and Guided Image Filter[J]. Infrared Technology , 2021, 43(11): 1081-1088.
    [9]ZHANG Pengcheng, HE Mingxia, CHEN Shuo, ZHANG Hongzhen, ZHANG Xinxin. Terahertz Image Enhancement Based on Generative Adversarial Network[J]. Infrared Technology , 2021, 43(4): 391-396.
    [10]WU Ling, CHEN Niannian, LIAO Xiaohua. Infrared Image Enhancement Based on Regional Adaptive Multiscale Intense Light Fusion[J]. Infrared Technology , 2020, 42(11): 1072-1076, 1080.
  • Cited by

    Periodical cited type(1)

    1. 赵兵,祖杰,王梓涵. 10kV配电线路安全运行的智能旁路开关技术研究. 电气自动化. 2025(03): 112-115 .

    Other cited types(7)

Catalog

    Article views (118) PDF downloads (34) Cited by(8)
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return