WANG You, HAN Lixiang, FU Gui. Aerial Infrared Image Target Recognition Method Based on Improved YOLOv5s[J]. Infrared Technology , 2024, 46(7): 775-781, 801.
Citation: WANG You, HAN Lixiang, FU Gui. Aerial Infrared Image Target Recognition Method Based on Improved YOLOv5s[J]. Infrared Technology , 2024, 46(7): 775-781, 801.

Aerial Infrared Image Target Recognition Method Based on Improved YOLOv5s

More Information
  • Received Date: April 16, 2023
  • Revised Date: May 16, 2023
  • Available Online: July 24, 2024
  • To enhance the recognition efficiency of UAVs in dark conditions and reduce missed detections and delays in complex environments and road conditions, this study proposes an improved YOLOv5s-GN-CB infrared image recognition method. This method enhances the efficiency of UAV infrared aerial images for detecting vehicles, people, and other types of targets. The main improvements to YOLOv5s achieved in this study include the following three aspects: 1) introducing the Ghost module into the YOLOv5s backbone network and incorporating NWD loss into Ghost; 2) adding the coordinate attention (CA) mechanism; 3) incorporating a weighted bidirectional feature pyramid network (BiFPN). The improved YOLOv5s-GN-CB detection model achieves an average accuracy of 95.1% (mAP@0.5) on the InfiRay infrared aerial photography man-vehicle detection dataset, with the FPS increased to 75.188 frames per second. Compared with the original YOLOv5 model, the average accuracy and FPS are improved by 4.2% and 12.02%, respectively. In the same scenario, the detection accuracy of UAV aerial photography infrared image target recognition has been significantly improved, and the delay rate has decreased.

  • [1]
    张志强, 王萍, 于旭东, 等. 高精度红外热成像测温技术研究[J]. 仪器仪表学报, 2020, 41(5): 10-18. https://www.cnki.com.cn/Article/CJFDTOTAL-YQXB202005002.htm

    ZHANG Zhiqiang, WANG Ping, YU Xudong, et al. Research on high-precision infrared thermal imaging temperature measurement technology [J]. Chinese Journal of Scientific Instrument, 2020, 41(5): 10-18. https://www.cnki.com.cn/Article/CJFDTOTAL-YQXB202005002.htm
    [2]
    Redmon J, Farhadi A. YOLOv3: an incremental improvement[EB/OL]. 2018-04-08, [2022-08-12]. https://arxiv.org/abs/1804.02767.
    [3]
    LIU W, Anguelov D, Erhan D, et al. SSD: single shot MultiBox detector[C]//Computer Vision-ECCV, 2016: 21-37.
    [4]
    Redmon J, Divvala S, Girshick R, et al. You only look once: unified, real-time object detection[C]// IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016: 779-788.
    [5]
    ZHAO Z Y, YANG X X, ZHOU Y CH, et al. Real-time detection of particleboard surface defects based on improved YOLOV5 target detection[J]. Scientific Reports, 2021, 11(1): 1-15. DOI: 10.1038/s41598-020-79139-8
    [6]
    WANG J, CHEN Y, GAO M, et al. Improved YOLOv5 network for real-time multi-scale traffic sign detection[J/OL]. arXiv: 2112.08782, https://arxiv.org/abs/2112.08782
    [7]
    WEI Yi, LI Xiaofei, LIN Lihui, et al. Causal discovery on discrete data via weighted normalized wasserstein distance[J]. IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(4): 4911-4923. Doi: 10.1109/TNNLS.2022.3213641
    [8]
    HOU Q, ZHOU D, FENG J. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021: 13713-13722.
    [9]
    TAN M, PANG R, LE Q V. Efficientdet: scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 10781-10790.
    [10]
    LIU S, QI L, QIN H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 8759-8768.
    [11]
    Nawaz Marriam, Nazir Tahira, Baili Jamel, et al. CXray-EffDet: chest disease detection and classification from X-ray images using the efficientdet model[J]. Diagnostics, 2023, 13(2): 248-248. DOI: 10.3390/diagnostics13020248
    [12]
    HAO Hongyun, FANG Peng, DUAN Enze, et al. A dead broiler inspection system for large-scale breeding farms based on deep learning[J]. Agriculture, 2022, 12(8): 1176-1176. DOI: 10.3390/agriculture12081176
    [13]
    郭吉朋. 基于YOLOv3模型的实时目标检测算法的FPGA实现[D]. 武汉: 华中科技大学, 2020. DOI: 10.27157/d.cnki.ghzku.2020.004780.

    GUO Jipeng. FPGA implementation of real-time object detection algorithm based on YOLOv3 model[D]. Wuhan: Huazhong University of Science and Technology, 2020. DOI: 10.27157/d.cnki.ghzku.2020.004780.
    [14]
    Aliyi Salih, Dese Kokeb, Raj Hakkins. Detectio of gastrointestinal tract disorders using deep learning methods from colonoscopy images and videos[J]. Scientific African, 2023, 20: e01628. DOI: 10.1016/j.sciaf.2023.e01628
    [15]
    王新彦, 易政洋. 基于改进YOLOv5的割草机器人工作环境障碍物检测方法研究[J]. 中国农机化学报, 2023, 44(3): 171-176. DOI: 10.13733/j.jcam.issn.2095-5553.2023.03.024.

    WANG Xinyan, YI Zhengyang. Research on obstacle detection method of mowing robot working environment based on improved YOLOv5[J]. Journal of Chinese Agricultural Mechannization, 2023, 44(3): 171-176. DOI: 10.13733/j.jcam.issn.2095-5553.2023.03.024.
    [16]
    徐正军, 张强, 许亮. 一种基于改进YOLOv5s-Ghost网络的交通标志识别方法[J]. 光电子·激光, 2023, 34(1): 52-61. DOI:10.16136/j.joel. 2023.01.0216.

    XU Zhengjun, ZHANG Qiang, XU Liang. Research on obstacle detection method for working environment of lawn mower robot based on improved YOLOv5[J]. Journal of Optoelectronics Laser, 2023, 34(1): 52-61. DOI:10.16136/j.joel. 2023.01.0216.
    [17]
    党宏社, 党晨, 张选德. 基于改进YOLOv5s的交通标志识别算法[J]. 实验技术与管理, 2022, 39(9): 97-102. DOI: 10.16791/j.cnki.sjg.2022.09.016.

    DANG Hongshe, DANG Chen, ZHANG Xuande. Traffic sign recognition algorithm based on improved YOLOv5s[J]. Experimental Technology and Management, 2022, 39(9): 97-102. DOI: 10.16791/j.cnki.sjg.2022.09.016.
  • Related Articles

    [1]LI Ziqian, BAN Yanwameng, LIU Yun, HE Dong, DU Rucai. Visible and Infrared Image Matching Method Based on Multi-Scale Feature Point Extraction[J]. Infrared Technology , 2025, 47(3): 351-357.
    [2]QI Yanjie, HOU Qinhe. Infrared and Visible Image Fusion Combining Multi-scale and Convolutional Attention[J]. Infrared Technology , 2024, 46(9): 1060-1069.
    [3]WANG Yan, ZHANG Jinfeng, WANG Likang, FAN Xianghui. Underwater Image Enhancement Based on Attention Mechanism and Feature Reconstruction[J]. Infrared Technology , 2024, 46(9): 1006-1014.
    [4]LI Qiuheng, DENG Hao, LIU Guihua, PANG Zhongxiang, TANG Xue, ZHAO Junqin, LU Mengyuan. Infrared and Visible Images Fusion Method Based on Multi-Scale Features and Multi-head Attention[J]. Infrared Technology , 2024, 46(7): 765-774.
    [5]CHONG Fating, DONG Zhangyu, YANG Xuezhi, ZENG Qingwang. SAR and Multispectral Image Fusion Based on Dual-channel Multi-scale Feature Extraction and Attention[J]. Infrared Technology , 2024, 46(1): 61-73.
    [6]QU Haicheng, HU Qianqian, ZHANG Xuecong. Infrared and Visible Image Fusion Combining Information Perception and Multiscale Features[J]. Infrared Technology , 2023, 45(7): 685-695.
    [7]LI Yueyi, DING Hongchang, ZHANG Lei, ZHAO Changfu, ZHANG Shibo, WANG Aijia. Pupil Diopter Detection Approach Based on Improved YOLOv3[J]. Infrared Technology , 2022, 44(7): 702-708.
    [8]WANG Fang, LI Chuanqiang, WU Bo, YU Kun, JIN Chan, CHEN Yake, LU Yinghui. Infrared Small Target Detection Method Based on Multi-Scale Feature Fusion[J]. Infrared Technology , 2021, 43(7): 688-695.
    [9]ZHANG Hao, LI Na, WANG Lu. Fast Multi-sensor Image Matching Algorithm Based on a Multi-scale Dense Structure Feature[J]. Infrared Technology , 2020, 42(5): 420-425.
    [10]SUN Shixin, ZHENG Zhiyun. Genetic Algorithm for Infrared Multi-target Detection Based on Multi-scale NNLoG Feature[J]. Infrared Technology , 2019, 41(9): 837-842.
  • Cited by

    Periodical cited type(2)

    1. 朱敏鸣,应祥岳. 无人机视觉小目标检测的改进YOLOv8s算法研究. 企业科技与发展. 2025(02): 90-95 .
    2. 班国邦,付磊,蒋理,杜昊,黎安俊,何雨昱,周骏超. 基于图像筛选的两阶段复杂作业人员行为动态风险辨识. 电力大数据. 2024(08): 58-69 .

    Other cited types(0)

Catalog

    Article views (105) PDF downloads (59) Cited by(2)
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return