Keyphrases
- author identification
- class imbalance
- highly skewed
- class distribution
- imbalanced datasets
- active learning
- cost sensitive
- majority class
- cost sensitive learning
- high dimensionality
- minority class
- concept drift
- misclassification costs
- feature selection
- imbalanced data
- training data
- sampling methods
- rare events
- imbalanced data sets
- training set
- text classification
- benchmark datasets
- prior knowledge