Efficient automatic OCR word validation using word partial format derivation and language model.
Siyuan ChenDharitri MisraGeorge R. ThomaPublished in: DRR (2010)
Keyphrases
- language model
- n gram
- translation model
- language modeling
- word clouds
- statistical machine translation
- word error rate
- context sensitive
- probabilistic model
- multiword
- speech recognition
- information retrieval
- language modelling
- word segmentation
- document retrieval
- co occurrence
- query expansion
- retrieval model
- statistical language modeling
- term weighting
- relevance model
- automatic speech recognition
- optical character recognition
- language independent
- statistical language models
- improve retrieval effectiveness
- ad hoc information retrieval
- query terms