Paper
21 July 2023 Indoor scene recognition method based on multi-scale feature attention mechanism
Yingnan Zhang, Jingwen Li, Jianwu Jiang
Author Affiliations +
Proceedings Volume 12717, 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023); 1271739 (2023) https://doi.org/10.1117/12.2684625
Event: 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023), 2023, Wuhan, China
Abstract
To address the problem of low accuracy of traditional scene recognition methods in indoor environment, an indoor scene recognition method combining multi-scale features and attention mechanism is proposed. The method uses Efficientnet-B3 as the backbone network, introduces channel and spatial attention modules to improve the refinement capability of the network for features, and designs a multi-scale feature fusion structure to enhance the adaptability of the network to scales, based on which the model parameters are optimized by adding a spatial pyramid, thus further improving the model calculation accuracy. The experimental analysis shows that the average accuracy of this model reaches 94.4% in nine types of indoor scenes, all of which are better than the calculation results of AlexNet, VGGNet16, GoogLeNet, ResNet34, EfficientNet and other models, providing a new way of thinking for indoor scene recognition.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Yingnan Zhang, Jingwen Li, and Jianwu Jiang "Indoor scene recognition method based on multi-scale feature attention mechanism", Proc. SPIE 12717, 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023), 1271739 (21 July 2023); https://doi.org/10.1117/12.2684625
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Data modeling

Image classification

Scene classification

Network architectures

Convolution

Education and training

Back to Top