Data-Efficient Performance Modeling for Configurable Big Data Frameworks by Reducing Information Overlap Between Training Examples.
Zhiqiang LiuXuanhua ShiHai JinPublished in: Big Data Res. (2022)
Keyphrases
- big data
- training examples
- digital data
- vast amounts of data
- data analysis
- huge data
- small number
- training data
- prior domain knowledge
- data processing
- training samples
- supervised learning
- cloud computing
- data sets
- prior knowledge
- end users
- support vector
- massive datasets
- massive data
- stored data
- labeled data
- social media
- training set
- textual data
- information processing
- unlabeled data
- data management
- knowledge management
- data sources
- big data analytics