The Effects of Random Undersampling with Simulated Class Imbalance for Big Data.
Tawfiq HasaninTaghi M. KhoshgoftaarPublished in: IRI (2018)
Keyphrases
- class imbalance
- big data
- random subspaces
- class distribution
- active learning
- cost sensitive
- cost sensitive learning
- majority class
- data analysis
- social media
- high dimensionality
- cloud computing
- data management
- data processing
- business intelligence
- sampling methods
- concept drift
- feature selection
- imbalanced datasets
- imbalanced data
- big data analytics
- data warehousing
- knowledge discovery
- minority class
- training data
- data warehouse
- end users
- data streams
- learning algorithm
- databases