An Efficient Word Segmentation Technique for Historical and Degraded Machine-Printed Documents.
Michael MakridisNikos A. NikolaouBasilios GatosPublished in: ICDAR (2007)
Keyphrases
- word segmentation
- printed documents
- document analysis
- handwritten document images
- language independent
- document images
- handwritten documents
- handwriting recognition
- word recognition
- n gram
- historical documents
- character recognition
- word level
- document processing
- optical character recognition
- image analysis
- cross language
- cross lingual
- document image analysis
- text classification
- hidden markov models
- text analysis
- natural language processing