When Tesseract Meets PERO: Open-Source Optical Character Recognition of Medieval Texts.
Vít NovotnýAles HorákPublished in: RASLAN (2022)
Keyphrases
- optical character recognition
- open source
- character recognition
- text recognition
- document images
- ocr systems
- character segmentation
- source code
- natural language generation
- image binarization
- page segmentation
- case study
- natural language
- handwriting recognition
- printed documents
- computer vision
- text segmentation
- machine vision
- text documents
- comparative evaluation
- text extraction
- image analysis
- machine learning