Analysis of Data Preprocessing Increasing the Oversampling Ratio for Extremely Imbalanced Big Data Classification.
Sara del RíoJosé Manuel BenítezFrancisco HerreraPublished in: TrustCom/BigDataSE/ISPA (2) (2015)
Keyphrases
- data preprocessing
- big data
- preprocessing
- data analysis
- class imbalance
- feature selection
- data mining
- preprocessing step
- decision trees
- support vector
- data management
- cloud computing
- data warehousing
- machine learning
- classification accuracy
- database management systems
- knowledge management
- training set
- web usage mining
- minority class
- feature extraction