Context-Sensitive Error Correction: Using Topic Models to Improve OCR.
Michael L. WickMichael G. RossErik G. Learned-MillerPublished in: ICDAR (2007)
Keyphrases
- error correction
- context sensitive
- topic models
- topic modeling
- latent dirichlet allocation
- error detection
- text documents
- natural language
- information retrieval
- latent variables
- text mining
- co occurrence
- latent topics
- language model
- probabilistic model
- probabilistic topic models
- latent topic models
- watermarking scheme
- data mining
- spelling correction