Fast words boundaries localization in text fields for low quality document images.
Dmitry IlinDmitriy NovikovDmitry PolevoyDmitry P. NikolaevPublished in: ICMV (2017)
Keyphrases
- low quality
- document images
- printed text
- printed documents
- text lines
- word level
- historical documents
- document analysis
- high quality
- word spotting
- handwritten documents
- indian languages
- document processing
- document image analysis
- scanned documents
- page layout
- mathematical formulas
- text documents
- text regions
- scanned document images
- language identification
- word recognition
- optical character recognition
- handwriting recognition
- machine printed text
- poor quality
- text detection
- character recognition
- scanned images
- document layout
- page segmentation
- fingerprint images
- word segmentation
- n gram
- keywords
- information retrieval
- document image retrieval
- text mining
- line extraction
- text processing