HU Yan, YUAN Zihao, TU Xiaoguang, LIU Jianhua, LEI Xia, WANG Wenjing. Improved SSD Object Detection Algorithm Based on Contrastive Learning[J]. Infrared Technology , 2024, 46(5): 548-555.
Citation: HU Yan, YUAN Zihao, TU Xiaoguang, LIU Jianhua, LEI Xia, WANG Wenjing. Improved SSD Object Detection Algorithm Based on Contrastive Learning[J]. Infrared Technology , 2024, 46(5): 548-555.

Improved SSD Object Detection Algorithm Based on Contrastive Learning

More Information
  • Received Date: May 17, 2023
  • Revised Date: July 10, 2023
  • Available Online: May 23, 2024
  • The existing deep learning-based object detection algorithms encounter various issues during the object detection process in images, such as object viewpoint diversity, object deformation, detection occlusion, illumination variations, and detection of small objects. To address these issues, this paper introduces the concept of contrastive learning into the SSD object detection network and improves the original SSD algorithm. First, by randomly cropping object images and background images from sample images using the method of image cropping, the object image blocks and background image blocks are input into the contrastive learning network for feature extraction and contrastive loss calculation. The supervised learning method is then used to train the SSD network, and the contrastive loss is fed into the SSD network and weighted and summed with the SSD loss value for feedback to optimize the network parameters. Because the contrastive learning concept is introduced into the object detection network, the distinction between the background and object in the feature space is improved. Therefore, the proposed algorithm significantly improves the accuracy of the SSD network for object detection, and obtains satisfactory detection results in both visible and thermal infrared images. In the experiment on the PASCAL VOC2012 dataset, the proposed algorithm shows an increase in the AP50 value by 0.3%, whereas in the case of the LLVIP dataset, the corresponding increase in AP50 value is 0.2%.

  • [1]
    XIA G S, BAI X, DING J, et al. DOTA: a large scale dataset for object detection in aerial images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 3974-3983.
    [2]
    ZHANG J, LIANG X, WANG M, et al. Coarse⁃to⁃fine object detection in unmanned aerial vehicle imagery using lightweight convolutional neural network and deep motion saliency[J]. Neurocomputing, 2020, 398: 555-565. DOI: 10.1016/j.neucom.2019.03.102
    [3]
    Pathak A, Pandey M, Rautaray S. Application of deep learning for object detection[J]. Procedia Computer Science, 2018, 132: 1706⁃1717. DOI: 10.1016/j.procs.2018.05.144
    [4]
    LIU W, Anguelov D, Erhan D, et al. SSD: Single shot MultiBox detector[C]//Proceedings of the 14th 284 European Conference on Computer Vision, 2016: 21-37.
    [5]
    LIU G, NOUAZE J C, TOUKO P L, et al. YOLO-tomato: A robust algorithm for tomato detection based on YOLOv3[J]. Sensors, 2020, 20(7): 2145.1-2145.20.
    [6]
    Redmon J, Farhadi A. Yolo9000: Better, faster, stronger[C]// Computer Vision and Pattern Recognition (CVPR), 2017: 6517-6525.
    [7]
    Sruthi M S, Poovathingal M J, Nandana V N, et al. YOLOv5 based open-source UAV for human detection during search and rescue (SAR) [C]// 10th International Conference on 13 Advances in Computing and Communications, 2021: 1-6.
    [8]
    ZHU X K, LYU S C, WANG X, et al. TPH-YOLOv5: improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]//Proceedings of the IEEE International Conference on Computer Vision, 2021: 2778-2788.
    [9]
    CHEN C, LIU M Y, Tuzel O, et al. R-CNN for small object detection[C]//Asian Conference on Computer Vision, 2016: 214-230.
    [10]
    Girshick R. Fast R-CNN[C]//IEEE International Conference on Computer Vision, 2015: 1440-1448.
    [11]
    REN S Q, HE K M, Girshick R, et al. Faster CNN: Towards real-time object detection with region proposal networks[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems, 2015: 91-99.
    [12]
    WANG Longguang, WANG Yingqian, DONG Xiaoyu, et al. Unsupervised degradation representation learning for blind super-resolution[C]//CVPR, 2021: 10581-10590.
    [13]
    HUANG Y, TU X, FU G, et al. Low-Light image enhancement by learning contrastive representations in spatial and frequency domains[J]. arXiv preprint arXiv: 2303.13412, 2023.
    [14]
    SUN X H, GU J N, HUANG R. A modified SSD method for electronic computer fast recognition[J]. Optik, 2020, 205: 163767. DOI: 10.1016/j.ijleo.2019.163767
    [15]
    FU C Y, LIU W, Ranga A, et al. Dssd: DeConvolutional single shot detector[J]. arXiv preprint arXiv: 1701. 06659, 2017.
    [16]
    Jeong J, Park H, Kwak N. Enhancement of SSD by con-catenating feature maps for object detection[J]. arXiv preprint arXiv: 1705. 09587, 2017.
    [17]
    李文涛, 彭力. 多尺度通道注意力融合网络的小目标检测算法[J]. 计算机科学与探索, 2021, 15(12): 2390-2400. https://www.cnki.com.cn/Article/CJFDTOTAL-KXTS202112013.htm

    LI Wentao, PENG Li. Small objects detection algorithm with multi-scale channel attention fusion network[J]. Journal of Frontiers of Computer Science & Technology, 2021, 15(12): 2390-2400. https://www.cnki.com.cn/Article/CJFDTOTAL-KXTS202112013.htm
    [18]
    LI Z, ZHOU F. FSSD: feature fusion single shot multibox detector[J]. arXiv preprint arXiv: 1712. 00960, 2017.
    [19]
    CHEN T, Kornblith S, Norouzi M, et al. A simple framework for contrastive learning of visual representations[C]//Proceedings of the 37th International Conference on Machine Learning, 2020: 1597-1607.
    [20]
    HE K M, FAN H Q, WU Y X, et al. Momentum contrast for unsupervised visual representation learning[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020: 9726-9735.
    [21]
    Grill J B, Strub F, Altche F, et al. Bootstrap your own latent a new approach to self-supervised learning[C]//Proceedings of the 34th International Conference on Neural Information Processing Systems (NIPS), 2020: 2127121284.
    [22]
    Caron M, Misra I, Mairal J, et al. Unsupervised learning of visual features by contrasting cluster assignments[C]//Proceedings of the 34th International Conference on Neural Information Processing Systems, 2020: 99129924.
    [23]
    CHEN X L, HE K M. Exploring simple Siamese representation learning[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021: 1574515753.
    [24]
    Bell S, Zitnick CL, Bala K, Girshick R. Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 2874-2883.
  • Related Articles

    [1]TAN Dan, ZHANG Zhijie, WANG Luxiang, YIN Wuliang. Design of Infrared Thermal Wave Image Defect Detection System Based on LabVIEW[J]. Infrared Technology , 2024, 46(4): 483-490.
    [2]LI Xianjing, HAO Zhenghui. Infrared Thermal Imaging Smoke Detection Based on Motion and Fuzzy Features[J]. Infrared Technology , 2024, 46(3): 325-331.
    [3]ZHANG Lingling, XU Ao, ZHANG Jiran, REN Panpan, DING Libin, WEI Daixiao. Research on Calculation of Defect Area of Building Exterior Windows Based on Infrared Image Processing Technology[J]. Infrared Technology , 2022, 44(12): 1358-1366.
    [4]WANG Luxiang, ZHANG Zhijie, WANG Quan, CHEN Haoze. Infrared Image Defect Detection Based on the Algorithm of Intuitionistic Fuzzy C-Means Clustering[J]. Infrared Technology , 2022, 44(11): 1220-1227.
    [5]JIN Meixiu, ZHU Shihu, WANG Tong, ZHUANG Feifei. Nondestructive Crack Testing via Infrared Thermal Imaging Using Halogen Lamp Excitation[J]. Infrared Technology , 2022, 44(4): 421-427.
    [6]ZENG Jingni, DENG Fangge. Application of Infrared Thermal Imaging in Breast Disease Detection[J]. Infrared Technology , 2020, 42(5): 501-505.
    [7]YI Shi, NIE Yan, ZHANG Yangyi, ZHAO Qianqian, ZHUANG Yitong. Nighttime Target Recognition Method Based on Infrared Thermal Imaging and YOLOv3[J]. Infrared Technology , 2019, 41(10): 970-975.
    [8]JIANG Fenqiang, HE Xiandeng, CHEN Nan, SU Qingdan. Design and Implementation of a Vehicle Infrared Image Processing System[J]. Infrared Technology , 2017, 39(5): 389-393,403.
    [9]ZHAO Jing-yuan, WANG Li-ming, LIU Bin. The Research of Infrared Image Sequence Enhancement Based on SVD Algorithm[J]. Infrared Technology , 2009, 31(1): 47-50. DOI: 10.3969/j.issn.1001-8891.2009.01.013
    [10]Research on Infrared Thermal Image Technology for Measurement of Peculiar Material and Constructional Component[J]. Infrared Technology , 2001, 23(5): 37-38,44. DOI: 10.3969/j.issn.1001-8891.2001.05.011

Catalog

    Article views (91) PDF downloads (32) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return