Term Frequency Normalization via Pareto Distributions.
Gianni AmatiC. J. van RijsbergenPublished in: ECIR (2002)
Keyphrases
- term frequency
- text categorization
- tf idf
- text classification
- retrieval model
- bag of words
- average precision
- text documents
- document frequency
- term weighting
- document length normalization
- inverse document frequency
- query terms
- information retrieval
- document representation
- image classification
- bayesian networks
- vector space model
- active learning
- language model
- image content
- knowledge discovery
- retrieval systems
- knn
- text mining
- feature selection
- computer vision
- information retrieval systems
- machine learning