Profiling of OCR'ed Historical Texts Revisited.
Florian FinkKlaus U. SchulzUwe SpringmannPublished in: CoRR (2017)
Keyphrases
- optical character recognition
- post processing
- historical manuscripts
- preprocessing
- document images
- character recognition
- historical data
- error correction
- natural language text
- natural language generation
- legal texts
- document image analysis
- domain dependent
- document processing
- recognition errors
- website
- historical information
- syntactic structures
- text documents
- page layout
- chinese texts
- natural language