An Improved Boosting Algorithm and its Application to Text Categorization.
Fabrizio SebastianiAlessandro SperdutiNicola ValdambriniPublished in: CIKM (2000)
Keyphrases
- text categorization
- hierarchical text categorization
- feature selection
- text classification
- multi label
- knn
- reuters corpus
- naive bayes
- k nearest neighbor
- information gain
- automated text categorization
- feature weighting
- text documents
- automatic text categorization
- base classifiers
- semi supervised learning
- text collections
- document categorization
- document frequency
- ensemble methods
- mutual information
- learning algorithm
- information retrieval
- data sets
- semantic browsing
- term selection
- text classifiers
- term frequency
- unlabeled data
- feature set
- machine learning