Improved cost-sensitive representation of data for solving the imbalanced big data classification problem.
Mahboubeh FattahiMohammad Hossein MoattarYahya ForghaniPublished in: J. Big Data (2022)
Keyphrases
- big data
- cost sensitive
- vast amounts of data
- data analysis
- cloud computing
- cost sensitive classification
- cost sensitive learning
- training data
- class imbalance
- data sets
- data processing
- database
- support vector machine
- data sources
- misclassification costs
- multi class
- end users
- feature extraction
- text classification
- naive bayes
- knowledge discovery
- base classifiers
- machine learning
- big data analytics