Digitizing Old Nubian Dictionary: Optical Character Recognition for Multi-Lingual and Multi-Script Text from Medieval Africa.
So MiyagawaPublished in: CiSt (2023)
Keyphrases
- multi lingual
- optical character recognition
- language identification
- document images
- ocr systems
- text recognition
- printed documents
- text lines
- language independent
- character recognition
- information access
- multiple information sources
- text extraction
- printed text
- text regions
- information retrieval
- cross lingual
- historical manuscripts
- handwriting recognition
- scanned documents
- document analysis
- domain dependent
- word spotting
- information retrieval systems
- machine learning
- handwritten documents
- character n grams
- keywords