Seflow: Efficient Flow Scheduling for Data-Parallel Jobs.
Qiao ZhouZiyang LiPing ZhongTian TianYuxing PengPublished in: ICDCS Workshops (2017)
Keyphrases
- data sets
- image data
- data analysis
- data collection
- data points
- parallel machines
- data processing
- data structure
- data quality
- data mining techniques
- high quality
- raw data
- database
- training data
- synthetic data
- precedence constraints
- job scheduling
- high dimensional data
- scheduling problem
- probability distribution
- end users
- high dimensional