ZHANG Fei, WANG Jian, ZHANG Yuesong. Lightweight Infrared Small Target Detection Algorithm under Oblique View Based on YOLOv5[J]. Infrared Technology , 2025, 47(2): 217-225.
Citation: ZHANG Fei, WANG Jian, ZHANG Yuesong. Lightweight Infrared Small Target Detection Algorithm under Oblique View Based on YOLOv5[J]. Infrared Technology , 2025, 47(2): 217-225.

Lightweight Infrared Small Target Detection Algorithm under Oblique View Based on YOLOv5

More Information
  • Received Date: May 13, 2023
  • Revised Date: May 31, 2023
  • To address the challenge of fast and accurate detection of small infrared pedestrian targets at inclined viewing angles, a lightweight real-time detection network model for small infrared pedestrian targets (DRA-YOLO) was proposed. First, K-means++ anchor box clustering was utilized to adapt to targets of different size scales, thereby accelerating network convergence and improving detection accuracy. Second, different attention mechanisms were integrated into the redesigned feature extraction network to enhance feature location and computational efficiency. This was coupled with an improved feature pyramid structure to extract key features and enhance model stability. Finally, the neck was redesigned by eliminating down-sampling and reorganizing it with SimAM to form a new feature fusion structure. Moreover, the detection head was redesigned to suit the dataset used in this study. Comparative experiments showed that, relative to the original YOLOv5s model, the proposed method performed excellently on both self-made and public datasets. The mAP50 reached 94.5%, detection speed improved by 20.8%, model size was compressed to 10.1 MB (30.3% reduction), and GFLOPs decreased by 29.1%. These improvements facilitated the accurate and rapid detection of targets, effectively balancing model size, detection accuracy, and inference speed.

  • [1]
    张伟, 庄幸涛, 王雪力, 等. DS-YOLO: 一种部署在无人机终端上的小目标实时检测算法[J]. 南京邮电大学学报(自然科学版), 2021, 41(1): 86-98.

    ZHANG Wei, ZHUANG Xingtao, WANG Xueli, et al. DS-YOLO: a real-time small object detection algorithm deployed on UAV terminals [J]. Journal of Nanjing University of Posts and Telecommunications (Natural Science Edition), 2021, 41(1): 86-98.
    [2]
    彭嘉雄, 周文琳. 红外背景抑制与小目标分割检测[J]. 电子学报, 1999, 27(12): 47-51. DOI: 10.3321/j.issn:0372-2112.1999.12.013

    PENG Jiaxiong, ZHOU Wenlin. Infrared background suppression for segmenting and detecting small target[J]. Acta Electronica Sinica, 1999, 27(12): 47-51. DOI: 10.3321/j.issn:0372-2112.1999.12.013
    [3]
    李刚, 邱尚斌, 林凌, 等. 基于背景差法和帧间差法的运动目标检测方法[J]. 仪器仪表学报, 2006, 27(8): 961-964.

    LI Gang, QIU Shangbin, LI Ling. New moving target detection method based on background differencing and coterminous frames differencing[J]. Chinese Journal of Scientific Instrument, 2006, 27(8): 961-964.
    [4]
    HE K, Gkioxari G, Dollár P, et al. Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 2961-2969.
    [5]
    Girshick R. Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
    [6]
    REN S, HE K, Girshick R, et al. Faster R-CNN: towards real-time object detection with region proposal networks[J]. Advances in Neural Information Processing Systems, 2015, 28(1): 91-99.
    [7]
    LIU W, Anguelov D, Erhan D, et al. SSD: single shot multibox detector[C]//Computer VisionECCV 2016: 14th European Conference, Amsterdam, 2016: 21-37.
    [8]
    Redmon J, Divvala S, Girshick R, et al. You only look once: unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 779-788.
    [9]
    Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017: 7263-7271.
    [10]
    SUN M, ZHANG H, HUANG Z, et al. Road infrared target detection with Ⅰ‐YOLO[J]. IET Image Processing, 2022, 16(1): 92-101. DOI: 10.1049/ipr2.12331
    [11]
    SONG X, GAO S, CHEN C. A multispectral feature fusion network for robust pedestrian detection[J]. Alexandria Engineering Journal, 2021, 60(1): 73-85. DOI: 10.1016/j.aej.2020.05.035
    [12]
    张印辉, 张朋程, 何自芬, 等. 红外行人目标精细尺度嵌入轻量化实时检测[J]. 光子学报, 2022, 51(9): 266-276.

    ZHANG Yinhui, ZHANG Pengcheng, HE Zifen, et al. Lightweight real-time detection model of infrared pedestrian embedded in fine-scale[J]. Acta Photonica Sinica, 2022, 51(9): 266-276.
    [13]
    TAN S, YAN J, JIANG Z, et al. Approach for improving YOLOv5 network with application to remote sensing target detection[J]. Journal of Applied Remote Sensing, 2021, 15(3): 036512-036512.
    [14]
    Ghiasi G, LIN T Y, LE Q V. Nas-fpn: Learning scalable feature pyramid architecture for object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019: 7036-7045.
    [15]
    LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8759-8768.
    [16]
    CHEN J, KAO S, HE H, et al. Run, don't walk: chasing higher FLOPS for faster neural networks[J]. arXiv preprint arXiv: 2303.03667, 2023.
    [17]
    HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 13713-13722.
    [18]
    YANG L, ZHANG R Y, LI L, et al. Simam: a simple, parameter-free attention module for convolutional neural networks[C]//International Conference on Machine Learning. PMLR, 2021: 11863-11874.
    [19]
    JIA X, ZHU C, LI M, et al. LLVIP: A visible-infrared paired dataset for low-light vision[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 3496-3504.
  • Related Articles

    [1]ZHANG Fei, WANG Jian, ZHANG Yuesong. Lightweight Infrared Small Target Detection Algorithm under Oblique View Based on YOLOv5[J]. Infrared Technology , 2025, 47(2): 217-225.
    [2]ZHANG Xingping, SHAO Yanhua, MEI Yanying, ZHANG Xiaoqiang, CHU Hongyu. Aerial Infrared Pedestrian Detection with Saliency Map Fusion[J]. Infrared Technology , 2024, 46(9): 1043-1050.
    [3]CHEN Sijing, FU Zhitao, LI Ziqian, NIE Han, SONG Jiawen. A Visible and Infrared Image Fusion Algorithm Based on Adaptive Enhancement and Saliency Detection[J]. Infrared Technology , 2023, 45(9): 907-914.
    [4]LONG Zhiliang, DENG Yueming, WANG Runmin, DONG Jun. Infrared and Visible Image Fusion Based on Saliency Detection and Latent Low-Rank Representation[J]. Infrared Technology , 2023, 45(7): 705-713.
    [5]LI Shiji, LI Zhongmin, LI Wei. Pedestrian Detection Method Based on Improved ViBe and YOLO v3 Algorithms[J]. Infrared Technology , 2023, 45(2): 137-142.
    [6]LAI Benlin, ZHANG Yongan, ZHANG Yaping, TIAN Li, ZHAO Danlu, HUANG Junhao, WANG Gang, ZHAO Hang. Research on Stress Field Detection of Building Blasting Environment Based on Infrared Digital Holography[J]. Infrared Technology , 2023, 45(1): 102-109.
    [7]YE Kuntao, LI Wen, SHU Leilei, LI Sheng. Infrared and Visible Image Fusion Method Based on Improved Saliency Detection and Non-subsampled Shearlet Transform[J]. Infrared Technology , 2021, 43(12): 1212-1221.
    [8]ZHAO Shuang, CHEN Shuyue, WANG Qiaoyue. Infrared Pedestrian Detection in Complex Night Scenes[J]. Infrared Technology , 2021, 43(6): 575-582.
    [9]CHE Kai, XIANG Zhengtao, CHEN Yufeng, LYU Jian, ZHOU Yun. Research on Infrared Image Pedestrian Detection Based on Improved Fast R-CNN[J]. Infrared Technology , 2018, 40(6): 578-584.
    [10]WEI Li, DING Meng, ZENG Lijun. Pedestrian Detection Based on Objectness and Sparse Coding in a Single Infrared Image[J]. Infrared Technology , 2016, 38(9): 752-757.
  • Cited by

    Periodical cited type(1)

    1. 相敏月,涂振宇,孙逸飞,方强,马飞. 基于ECA和BIFPN的低照度环境下的行人目标检测算法. 智能计算机与应用. 2023(09): 189-193 .

    Other cited types(2)

Catalog

    Article views (63) PDF downloads (31) Cited by(3)
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return