Semi-supervised Text Categorization by Considering Sufficiency and Diversity.
Shoushan LiSophia Yat Mei LeeWei GaoChu-Ren HuangPublished in: NLPCC (2013)
Keyphrases
- text categorization
- semi supervised
- semi supervised learning
- unlabeled data
- labeled data
- multi label
- text classification
- supervised learning
- feature selection
- active learning
- information gain
- knn
- text documents
- pairwise
- k nearest neighbor
- unsupervised learning
- naive bayes
- feature weighting
- text classifiers
- automated text categorization
- reuters corpus
- document categorization
- semantic browsing
- text collections
- term frequency
- tf idf
- training data
- nearest neighbor
- term weighting
- data points
- machine learning
- training set