Paper
13 June 2024 D-CANet: diverse class-aware coding and decoding structure network for semantic segmentation of high-resolution remote sensing images
Zhengwu Yuan, Wen Shao, Qiang Chen, Yingqi Ke
Author Affiliations +
Proceedings Volume 13180, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024); 131801L (2024) https://doi.org/10.1117/12.3033644
Event: International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024), 2024, Guangzhou, China
Abstract
The substantial scale variation and intra-class diversity within remote sensing imagery pose significant challenges for semantic segmentation, rendering methods developed for natural images inapplicable. These challenges, we introduce a novel semantic segmentation model named D-CANet, which primarily comprises three modules: the Global Class Center Awareness (GCCA), the Local Class Awareness Module (LCAM), and the Global Class Generation Module (GCG). Specifically, the GCCA module is dedicated to modeling the global representation of class context to mitigate the interference from image backgrounds; the LCAM module generates a local class representation, serving as an intermediary perceptual element that facilitates an implicit linkage between pixels and global class representations, minimizing the variance within classes; following the processing by the LCAM module, the GCG module enhances the global class representation. This encoder-decoder structure equipped with GCCA, LCAM, and GCG modules achieves precise segmentation of objects of varying scales within remote sensing imagery through the interactive perception and fusion of global and local features. Experimental assessments conducted on the Potsdam dataset and the Vaihingen dataset illustrate that D-CANet surpasses the current state-of-the-art semantic segmentation techniques in terms of efficacy.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Zhengwu Yuan, Wen Shao, Qiang Chen, and Yingqi Ke "D-CANet: diverse class-aware coding and decoding structure network for semantic segmentation of high-resolution remote sensing images", Proc. SPIE 13180, International Conference on Image, Signal Processing, and Pattern Recognition (ISPP 2024), 131801L (13 June 2024); https://doi.org/10.1117/12.3033644
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Semantics

Remote sensing

Modeling

Matrices

Data modeling

Design

Back to Top