Data reduction techniques for highly imbalanced medicare Big Data.
John T. HancockHuanjing WangTaghi M. KhoshgoftaarQianxin LiangPublished in: J. Big Data (2024)
Keyphrases
- big data
- data reduction
- highly imbalanced
- data analysis
- class distribution
- knowledge discovery
- data sets
- imbalanced data
- cost sensitive
- data compression
- cloud computing
- preprocessing
- data processing
- feature selection
- business intelligence
- data mining
- data management
- training data
- classification accuracy
- training set
- rough set theory
- big data analytics
- data warehousing
- classification rules
- singular value decomposition
- high dimensionality
- model selection
- dimensionality reduction
- test set
- social media
- databases
- database management systems
- real world
- clustering algorithm
- expert systems
- query processing
- data warehouse
- supervised learning
- genetic programming
- high dimensional data
- training samples