Distributional Clustering of Words for Text Classification.
L. Douglas BakerAndrew McCallumPublished in: SIGIR (1998)
Keyphrases
- distributional clustering
- text classification
- information theoretic
- text categorization
- bag of words
- naive bayes
- text mining
- feature selection
- multi label
- knn
- text documents
- n gram
- labeled data
- text classifiers
- machine learning
- semantic features
- prior knowledge
- sentiment classification
- sentiment analysis
- text data
- data cleaning
- decision trees
- machine translation
- document clustering
- term frequency
- mutual information
- k nearest neighbor
- training corpus
- training documents
- natural language