OCR-D: An end-to-end open source OCR framework for historical printed documents.
Clemens NeudeckerKonstantin BaiererMaria FederbuschMatthias BoenigKay-Michael WürznerVolker HartmannElisa HerrmannPublished in: DATeCH (2019)
Keyphrases
- end to end
- printed documents
- optical character recognition
- document images
- open source
- character recognition
- document analysis
- text localization and recognition
- document processing
- character segmentation
- language independent
- congestion control
- document image analysis
- handwriting recognition
- error correction
- web services
- machine learning
- real time
- digital libraries