Strike the Balance between System Utilization and Data Locality under Deadline Constraint for MapReduce Clusters.
Yeh-Cheng ChenJerry ChouPublished in: PDCAT (2017)
Keyphrases
- data sets
- data processing
- data structure
- raw data
- data collection
- input data
- data points
- neural network
- data quality
- data objects
- synthetic data
- image data
- knowledge discovery
- data analysis
- data sources
- high quality
- original data
- data records
- small number
- data streams
- high dimensional data
- experimental data
- training data
- clustering algorithm
- metadata
- big data
- data samples
- dimensional vector