Paper
7 August 2024 DocPointNet: 3D text line detection method for point cloud
Fuping Su
Author Affiliations +
Proceedings Volume 13229, Seventh International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2024); 1322932 (2024) https://doi.org/10.1117/12.3037943
Event: Seventh International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2024), 2024, Nanchang, China
Abstract
With the advancement of the digitalisation wave, the importance of 3D point cloud data is becoming more and more prominent. In response to the current lack of 3D point cloud text line datasets, this paper constructs the DocPointCloud dataset, which is generated by 3D reconstruction of document images and provides rich data resources for 3D text line detection. At the same time, for the problem that existing approaches cannot accurately detect 3D text lines when processing document point clouds, this paper proposes the DocPointNet model. By introducing the feature pre-extraction module and the attention mechanism, this model significantly enhances the feature representation ability of the model on point cloud data, thus achieving the accurate detection of text lines in point cloud. Experimental results show that the mIoU of the model on the DocPointCloud dataset is 0.72, which verifies its effectiveness in the 3D point cloud text line detection task.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Fuping Su "DocPointNet: 3D text line detection method for point cloud", Proc. SPIE 13229, Seventh International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2024), 1322932 (7 August 2024); https://doi.org/10.1117/12.3037943
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Point clouds

3D modeling

Visual process modeling

Feature extraction

3D image processing

Ablation

3D vision

Back to Top