An Instance Selection Algorithm for Big Data in High imbalanced datasets based on LSH.
Germán E. Melo-AcostaFreddy Duitama-MuñozJulián D. Arias-LondoñoPublished in: CoRR (2022)
Keyphrases
- selection algorithm
- big data
- imbalanced datasets
- feature selection algorithms
- cloud computing
- data analysis
- data management
- data processing
- cost sensitive learning
- social media
- business intelligence
- imbalanced data
- class imbalance
- search algorithm
- feature subset
- class distribution
- sampling methods
- knowledge discovery
- database management systems
- neural network
- cost sensitive
- nearest neighbor
- feature selection