Open Source Tesseract in Re-OCR of Finnish Fraktur from 19th and Early 20th Century Newspapers and Journals - Collected Notes on Quality Improvement.
Kimmo KettunenMika KoistinenPublished in: DHN (2019)
Keyphrases
- quality improvement
- open source
- quality assurance
- industrial processes
- preprocessing
- optical character recognition
- post processing
- open source software
- source code
- st century
- product quality
- quality control
- web pages
- document images
- character recognition
- case study
- news articles
- error correction
- real world
- image data
- digital libraries
- recognition errors