Efficient Semantic-Aware Coflow Scheduling for Data-Parallel Jobs.
Ziyang LiYiming ZhangYunxiang ZhaoDongsheng LiPublished in: CLUSTER (2016)
Keyphrases
- data sets
- data analysis
- data sources
- data collection
- training data
- original data
- data distribution
- high quality
- resource allocation
- data processing
- computer systems
- statistical analysis
- parallel processing
- data points
- scheduling problem
- end users
- high dimensional data
- synthetic data
- missing data
- data structure
- data quality