Term Impacts as Normalized Term Frequencies for BM25 Similarity Scoring.
Vo Ngoc AnhRaymond WanAlistair MoffatPublished in: SPIRE (2008)
Keyphrases
- term frequency
- retrieval model
- similarity measure
- text categorization
- tf idf
- term proximity
- term weighting
- text classification
- average precision
- bag of words
- okapi bm
- document frequency
- term weights
- text documents
- document representation
- language model
- word frequency
- weighting scheme
- test collection
- information retrieval systems
- language modeling
- retrieval effectiveness
- semantic distance
- semantic similarity
- evaluation metrics
- document clustering
- information retrieval
- index terms
- document retrieval
- retrieval systems
- image representation
- nearest neighbor
- multiscale
- probabilistic retrieval model