Stochastic language models for style-directed layout analysis of document images.
Tapas KanungoSong MaoPublished in: IEEE Trans. Image Process. (2003)
Keyphrases
- document images
- language model
- language modeling
- n gram
- document image analysis
- document retrieval
- language modelling
- speech recognition
- probabilistic model
- query expansion
- test collection
- statistical language models
- information retrieval
- document analysis
- retrieval model
- query terms
- smoothing methods
- optical character recognition
- language models for information retrieval
- relevance model
- word level
- page layout
- printed documents
- scanned documents
- cross lingual
- printed text
- document ranking
- information extraction
- color images
- digital libraries