A Combination of Resampling and Ensemble Method for Text Classification on Imbalanced Data.
Haijun FengWen QinHuijing WangYi LiGuangwu HuPublished in: BigData Congress (2021)
Keyphrases
- ensemble methods
- imbalanced data
- text classification
- prediction accuracy
- decision trees
- ensemble learning
- machine learning methods
- benchmark datasets
- random forests
- base classifiers
- random forest
- generalization ability
- feature selection
- naive bayes
- text categorization
- machine learning
- ensemble classifier
- labeled data
- text mining
- multi label
- base learners
- k nearest neighbor
- neural network
- training data