Projecting Away the Class Imbalance Problem in Author Attribution.
Grant GehrkeCraig H. MartellAndrew I. ScheinPranav AnandPublished in: Int. J. Semantic Comput. (2009)
Keyphrases
- class imbalance
- class distribution
- active learning
- cost sensitive
- writing style
- cost sensitive learning
- majority class
- software defect prediction
- imbalanced datasets
- sampling methods
- concept drift
- feature selection
- high dimensionality
- small disjuncts
- random subspaces
- minority class
- imbalanced data
- text mining
- ensemble learning
- kernel function
- training set
- pattern recognition
- learning algorithm