Applying Probabilistic Term Weighting to OCR Text in the Case of a Large Alphabetic Library Catalogue.
Elke MittendorfPeter SchäubleParaic SheridanPublished in: SIGIR (1995)
Keyphrases
- term weighting
- text retrieval
- information retrieval
- tf idf
- optical character recognition
- text categorization
- text documents
- term frequency
- retrieval systems
- inverse document frequency
- language modeling
- document images
- web documents
- text mining
- semantic information
- document retrieval
- retrieval model
- generative model
- vector space model
- text classification
- co occurrence
- digital libraries
- keywords