Irrelevant attributes and imbalanced classes in multi-label text-categorization domains.
Sareewan DendamrongvitPeerapon VateekulMiroslav KubatPublished in: Intell. Data Anal. (2011)
Keyphrases
- text categorization
- multi label
- multi label classification
- text classification
- image annotation
- transfer learning
- feature selection
- knn
- binary classification
- multi label learning
- hierarchical text categorization
- naive bayes
- information gain
- automatic text categorization
- k nearest neighbor
- text classifiers
- semi supervised learning
- real world
- databases
- machine learning
- multiple labels
- feature selections
- unlabeled data
- em algorithm
- graph cuts
- image classification
- nearest neighbor
- information extraction
- decision trees