NeSSA: Near-Storage Data Selection for Accelerated Machine Learning Training.
Neha PrakriyaYu YangBaharan MirzasoleimanCho-Jui HsiehJason CongPublished in: HotStorage (2023)
Keyphrases
- machine learning
- data sets
- knowledge discovery
- data sources
- high quality
- data analysis
- database
- synthetic data
- data processing
- statistical analysis
- prior knowledge
- experimental data
- training examples
- raw data
- storage space
- training dataset
- storage systems
- knowledge acquisition
- labelled data
- databases
- small number
- image data
- end users
- decision trees
- input data
- sensor data
- learning algorithm
- storage devices
- feature selection
- data quality
- training data
- data structure
- missing data
- feature space
- relational databases
- training samples
- text classification