Keyphrases
- document clustering
- semi supervised learning
- binary classification
- misclassification costs
- cost sensitive
- semi supervised
- unlabeled data
- text documents
- labeled data
- clustering method
- clustering algorithm
- document collections
- multi class
- unsupervised learning
- text categorization
- text mining
- class distribution
- supervised learning
- machine learning
- target domain
- active learning
- cluster analysis
- training data
- naive bayes
- support vector
- k means
- generalization error
- text classification
- metric learning
- feature vectors
- class labels
- support vector machine
- training samples
- co occurrence