Evaluating classifier performance with highly imbalanced Big Data.
John T. HancockTaghi M. KhoshgoftaarJustin M. JohnsonPublished in: J. Big Data (2023)
Keyphrases
- big data
- class distribution
- highly imbalanced
- cost sensitive
- class imbalance
- cloud computing
- data management
- misclassification costs
- big data analytics
- social media
- data processing
- business intelligence
- data analysis
- vast amounts of data
- data science
- knowledge discovery
- test set
- training set
- data sets
- data warehousing
- training data
- imbalanced data
- concept drift
- case study
- training examples
- training samples
- feature selection