Subgroup discover in large size data sets preprocessed using stratified instance selection for increasing the presence of minority classes.
José Ramón CanoSalvador GarcíaFrancisco HerreraPublished in: Pattern Recognit. Lett. (2008)
Keyphrases
- instance selection
- data sets
- nearest neighbor
- feature and instance selection
- text classification
- data reduction
- multi class
- preprocessing
- semi supervised learning
- training set
- multiple instance learning
- knowledge discovery and data mining
- high dimensional data
- knowledge discovery
- support vector
- supervised learning
- classification accuracy
- classification algorithm
- data streams
- training data
- machine learning