Imbalanced big data classification: a distributed implementation of SMOTE.
Avnish Kumar RastogiNitin NarangZamir Ahmad SiddiquiPublished in: ICDCN Workshops (2018)
Keyphrases
- big data
- imbalanced data sets
- class imbalance
- imbalanced datasets
- imbalanced data
- data intensive
- class imbalanced
- class distribution
- cloud computing
- data analysis
- cost sensitive learning
- minority class
- feature selection
- unstructured data
- predictive modeling
- text classification
- decision trees
- vast amounts of data
- social media
- machine learning
- data warehousing
- support vector machine
- commodity hardware
- big data analytics
- case study
- data science
- data processing
- data management
- training set
- cost sensitive
- sampling methods
- training dataset
- database
- business intelligence
- active learning
- data sets