Predicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced data.
Carlos Márquez-VeraAlberto CanoCristóbal RomeroSebastián VenturaPublished in: Appl. Intell. (2013)
Keyphrases
- data mining approaches
- imbalanced data
- high dimensional
- high dimensionality
- data mining
- rare events
- data mining techniques
- class imbalance
- class distribution
- linear regression
- low dimensional
- ensemble methods
- sampling methods
- random forest
- feature selection
- decision trees
- high dimensional data
- learning process
- support vector machine
- data points
- classification models
- minority class
- dimensionality reduction
- svm classifier
- nearest neighbor
- model selection
- kernel function
- training samples
- text classification
- knowledge discovery
- association rules
- training data
- learning algorithm