Modeling with Words: an Approach to Text Categorization.
James G. ShanahanPublished in: FUZZ-IEEE (2001)
Keyphrases
- text categorization
- text documents
- distributional clustering
- training documents
- text classification
- feature selection
- knn
- document frequency
- multi label
- information gain
- semi supervised learning
- automatic text categorization
- k nearest neighbor
- word frequency
- automated text categorization
- feature weighting
- text collections
- term weighting
- naive bayes
- unlabeled data
- learning process
- n gram
- reuters corpus
- nearest neighbor
- feature selections
- neural network
- high dimensional
- feature generation
- text classifiers