An Empirical Study for Class Imbalance in Extreme Multi-label Text Classification.
Sangwoo HanChan LimBonggeon ChaJongwuk LeePublished in: BigComp (2021)
Keyphrases
- multi label
- class imbalance
- text classification
- cost sensitive
- feature selection
- text categorization
- class distribution
- multi label classification
- binary classification
- active learning
- machine learning
- naive bayes
- multi instance
- sampling methods
- text mining
- high dimensionality
- imbalanced datasets
- random forest
- unlabeled data
- learning tasks
- image classification
- labeled data
- knn
- k nearest neighbor
- concept drift
- class labels
- minority class
- data sets
- multiple labels
- graph cuts
- feature space
- multi class
- learning environment
- image features
- training examples
- semi supervised
- natural language processing