Paper
7 March 2024 A coarse-to-fine multi-view stereo network based on attention mechanisms
YaNan Xu, Zhiyong Zuo, Zhenbao Luo, Yuyong Cui, Zhongjian Wu
Author Affiliations +
Proceedings Volume 13086, MIPPR 2023: Pattern Recognition and Computer Vision; 130860M (2024) https://doi.org/10.1117/12.3004936
Event: Twelfth International Symposium on Multispectral Image Processing and Pattern Recognition (MIPPR2023), 2023, Wuhan, China
Abstract
Multi-view stereo is a method that analyzes and processes images from multiple perspectives to estimate the 3D geometric information of the scene to achieve 3D reconstruction. To improve the accuracy of 3D reconstruction in large-scale scenes and reduce the complexity of the reconstruction algorithm, in this paper, we propose a coarse-to-fine multi-view stereo network based on attention mechanism. First, we use a feature pyramid to extract multi-scale features, introducing richer geometric information and more contextual information at different levels of the pyramid to improve modeling accuracy. Then, we use position encoding on the coarse-scale feature map and introduce an attention mechanism to obtain more context information. We adopt a cascade structure to achieve high-resolution depth map construction. We use the reference image to refine the final result again and enhance details such as edges. We conduct experiments on the publicly available DTU dataset. Experimental results show that our proposed method improves accuracy compared with existing algorithms. In addition, we also conduct experiments on other representative public datasets. The accuracy of the experimental results further validates the effectiveness of our proposed method.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
YaNan Xu, Zhiyong Zuo, Zhenbao Luo, Yuyong Cui, and Zhongjian Wu "A coarse-to-fine multi-view stereo network based on attention mechanisms", Proc. SPIE 13086, MIPPR 2023: Pattern Recognition and Computer Vision, 130860M (7 March 2024); https://doi.org/10.1117/12.3004936
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
3D modeling

Neural networks

Modeling

Reconstruction algorithms

Back to Top