Keyphrases
- class imbalance
- machine learning
- active learning
- cost sensitive learning
- class distribution
- feature selection
- cost sensitive
- concept drift
- sampling methods
- majority class
- machine learning algorithms
- small disjuncts
- software defect prediction
- natural language processing
- imbalanced datasets
- information extraction
- high dimensionality
- machine learning methods
- text classification
- support vector machine
- data analysis
- supervised learning
- pattern recognition
- decision trees
- learning models
- k nearest neighbor
- text mining
- minority class
- imbalanced data
- learning algorithm