Using Bins to Empirically Estimate Term Weights for Text Categorization.
Carl L. SableKenneth Ward ChurchPublished in: EMNLP (2001)
Keyphrases
- data analysis
- text categorization
- term weights
- term weighting
- term frequency
- tf idf
- text classification
- knn
- feature selection
- k nearest neighbor
- text documents
- data mining
- semi supervised learning
- ranking functions
- co occurrence
- weighting schemes
- retrieval model
- retrieval effectiveness
- natural language
- supervised learning
- nearest neighbor
- high dimensional