An Unsupervised method for OCR Post-Correction and Spelling Normalisation for Finnish.
Quan DuongMika HämäläinenSimon HengchenPublished in: NoDaLiDa (2021)
Keyphrases
- high precision
- synthetic data
- data sets
- computational complexity
- detection method
- support vector machine svm
- classification accuracy
- error correction
- detection algorithm
- post processing
- supervised learning
- experimental evaluation
- hidden markov models
- prior knowledge
- feature space
- similarity measure
- image processing