OCR4all - An Open-Source Tool Providing a (Semi-)Automatic OCR Workflow for Historical Printings.
Christian ReulDennis ChristAlexander HarteltNico BalbachMaximilian WehnerUwe SpringmannChristoph WickChristine GrundigAndreas BüttnerFrank PuppePublished in: CoRR (2019)
Keyphrases
- semi automatic
- optical character recognition
- open source
- fully automatic
- recognition errors
- design rationale
- character recognition
- document images
- post processing
- gold standard
- domain ontology
- text recognition
- semi automatically
- ontology mapping
- error correction
- preprocessing
- source code
- semantic annotation
- document analysis
- handwriting recognition
- ground truth
- ontology development
- scanned documents
- case study
- decision making
- knowledge extraction
- manual annotation
- printed documents
- web services