Probabilistic Bounds for Data Storage with Feature Selection and Undersampling.
Ghurumuruhan GanesanPublished in: CoRR (2023)
Keyphrases
- data storage
- feature selection
- data management
- database management systems
- storage and retrieval
- data integrity
- relational database systems
- storage media
- upper bound
- machine learning
- class imbalance
- aggregated data
- storage devices
- lower bound
- high dimensionality
- support vector machine
- feature selection algorithms
- knn
- feature subset
- column oriented
- b tree
- dimensionality reduction
- multi class
- feature space
- bayesian networks
- query processing
- multi dimensional
- databases
- flash memory
- feature set
- database