25 August 2022 Improved YOLOv5-S object detection method for optical remote sensing images based on contextual transformer
Qikai Zhou, Wei Zhang, Ruizhi Li, Jin Wang, Shuhui Zhen, Fu Niu
Author Affiliations +
Abstract

To address the problems of error and omission detection in remote sensing image detection caused by the diverse scale changes of remote sensing object scales and the abundant proportion of small-scale objects, as well as the global and dense distribution of remote sensing objects, a remote sensing image detection improvement method based on YOLOv5-S is proposed. First, according to the characteristics of remote sensing objects, the data enhancement strategy is adopted to expand the dataset samples for the characteristics of remote sensing objects to improve the generalization ability of the model. Second, the contextual transformer module is introduced to the backbone feature extraction network and the feature fusion network to ensure the local feature extraction capability while improving the global information acquisition capability of the model, making full use of the input contextual information and guiding the dynamic attention matrix learning to improve the visual representation ability. Third, based on the original model, a shallow detection scale is added, and then a multiscale complex fusion structure is adopted. Meanwhile, the K-means++ algorithm replaces the original K-means algorithm and then clusters 12 anchor box sizes. Fourth, the efficient intersection over union loss is used to improve the accuracy of the remote sensing object recognition prediction. In the experiment on the on two optical remote sensing image datasets, a comparison with several object detection algorithms based on convolutional neural network is made, the results show that the mAP@0.5 tested on the remote sensing datasets is higher than the original YOLOv5-S. Compared with other models, the detection efficiency is higher, and the problems of small-scale object detection in remote sensing image have been significantly improved.

© 2022 SPIE and IS&T
Qikai Zhou, Wei Zhang, Ruizhi Li, Jin Wang, Shuhui Zhen, and Fu Niu "Improved YOLOv5-S object detection method for optical remote sensing images based on contextual transformer," Journal of Electronic Imaging 31(4), 043049 (25 August 2022). https://doi.org/10.1117/1.JEI.31.4.043049
Received: 7 April 2022; Accepted: 4 August 2022; Published: 25 August 2022
Lens.org Logo
CITATIONS
Cited by 7 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Remote sensing

Detection and tracking algorithms

Commercial off the shelf technology

Feature extraction

Transformers

Visualization

Data modeling

Back to Top