Paper
21 December 2000 Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence
Author Affiliations +
Proceedings Volume 4307, Document Recognition and Retrieval VIII; (2000) https://doi.org/10.1117/12.410860
Event: Photonics West 2001 - Electronic Imaging, 2001, San Jose, CA, United States
Abstract
The ability to accurately detect those areas in plain text documents that consist of contiguous text is an important pre- process to many applications. This paper introduces a novel method that uses both spatial and linguistic knowledge in an accurate manner to provide an initial analysis of the document. This initial analysis may then be extended to provide a complete analysis of the text areas in the document.
© (2000) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Matthew Hurst "Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence", Proc. SPIE 4307, Document Recognition and Retrieval VIII, (21 December 2000); https://doi.org/10.1117/12.410860
Lens.org Logo
CITATIONS
Cited by 14 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Associative arrays

Head

Inspection

Detection and tracking algorithms

Data modeling

Error analysis

Radon

Back to Top