Improving document clustering using Okapi BM25 feature weighting.
John S. WhissellCharles L. A. ClarkePublished in: Inf. Retr. (2011)
Keyphrases
- feature weighting
- document clustering
- tf idf
- term frequency
- vector space model
- text documents
- text mining
- document representation
- clustering algorithm
- text categorization
- clustering method
- document collections
- information retrieval
- k means
- cluster analysis
- machine learning
- natural language processing
- data analysis
- feature selection