FAST: a roc-based feature selection metric for small samples and imbalanced data classification problems.
Xue-wen ChenMichael WasikowskiPublished in: KDD (2008)
Keyphrases
- small samples
- imbalanced data
- feature selection
- support vector machine
- model selection
- classification models
- text categorization
- classification accuracy
- multi class
- text classification
- high dimensionality
- linear regression
- feature selection algorithms
- feature set
- class imbalance
- machine learning
- accurate models
- decision trees
- feature extraction
- support vector
- dimensionality reduction
- feature space
- ensemble methods
- roc curve
- data sets
- regression trees
- knn
- feature subset
- sample size
- class distribution
- ensemble classifier