Improving text categorization bootstrapping via unsupervised learning.
Alfio Massimiliano GliozzoCarlo StrapparavaIdo DaganPublished in: ACM Trans. Speech Lang. Process. (2009)
Keyphrases
- text categorization
- unsupervised learning
- semi supervised learning
- text classification
- feature selection
- multi label
- supervised learning
- semi supervised
- unlabeled data
- knn
- k nearest neighbor
- naive bayes
- information gain
- semantic browsing
- text documents
- information extraction
- reuters corpus
- automatic text categorization
- text classifiers
- object recognition
- dimensionality reduction
- term frequency
- feature space
- document categorization
- model selection
- multi instance multi label learning
- automated text categorization
- feature selection for text categorization
- training data
- data sets
- term selection
- maximum likelihood