Indoor scene recognition method based on multi-scale feature attention mechanism

Yingnan Zhang; Jingwen Li; Jianwu Jiang

doi:10.1117/12.2684625

21 July 2023 Indoor scene recognition method based on multi-scale feature attention mechanism

Yingnan Zhang, Jingwen Li, Jianwu Jiang

Proceedings Volume 12717, 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023); 1271739 (2023) https://doi.org/10.1117/12.2684625
Event: 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023), 2023, Wuhan, China

Abstract

To address the problem of low accuracy of traditional scene recognition methods in indoor environment, an indoor scene recognition method combining multi-scale features and attention mechanism is proposed. The method uses Efficientnet-B3 as the backbone network, introduces channel and spatial attention modules to improve the refinement capability of the network for features, and designs a multi-scale feature fusion structure to enhance the adaptability of the network to scales, based on which the model parameters are optimized by adding a spatial pyramid, thus further improving the model calculation accuracy. The experimental analysis shows that the average accuracy of this model reaches 94.4% in nine types of indoor scenes, all of which are better than the calculation results of AlexNet, VGGNet16, GoogLeNet, ResNet34, EfficientNet and other models, providing a new way of thinking for indoor scene recognition.

Citation Download Citation

Yingnan Zhang, Jingwen Li, and Jianwu Jiang "Indoor scene recognition method based on multi-scale feature attention mechanism", Proc. SPIE 12717, 3rd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2023), 1271739 (21 July 2023); https://doi.org/10.1117/12.2684625

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
8 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Feature extraction

Data modeling

Image classification

Scene classification

Network architectures

Convolution

Education and training

Show All Keywords

Keywords/Phrases

Search In:

Publication Years