Efficient Hybrid Oversampling and Intelligent Undersampling for Imbalanced Big Data Classification.
Carla VairettiJosé Luis AssadiSebastián MaldonadoPublished in: CoRR (2023)
Keyphrases
- class imbalance
- big data
- class distribution
- active learning
- cost sensitive
- cloud computing
- imbalanced data
- machine learning
- high dimensionality
- data analysis
- predictive modeling
- minority class
- majority class
- big data analytics
- decision trees
- cost sensitive learning
- unstructured data
- sampling methods
- concept drift
- imbalanced datasets
- social media
- support vector machine
- high volume
- feature selection
- knowledge discovery
- information retrieval
- statistical learning
- business intelligence
- end users
- training data
- data warehousing
- decision support
- decision makers
- massive data
- massive datasets
- vast amounts of data
- data science
- e learning
- data driven decision making