Approx-SMOTE: Fast SMOTE for Big Data on Apache Spark.
Mario Juez-GilÁlvar Arnaiz-GonzálezJuan José RodríguezCarlos López NozalCésar García-OsorioPublished in: Neurocomputing (2021)
Keyphrases
- big data
- class distribution
- imbalanced data sets
- imbalanced data
- class imbalance
- class imbalanced
- cloud computing
- open source
- data management
- data analysis
- data processing
- big data analytics
- social media
- high volume
- unstructured data
- massive data
- business intelligence
- vast amounts of data
- data science
- knowledge discovery
- data warehousing
- data analytics
- decision making
- database