Characterizing and Synthesizing Task Dependencies of Data-Parallel Jobs in Alibaba Cloud.
Huangshi TianYunchuan ZhengWei WangPublished in: SoCC (2019)
Keyphrases
- missing data
- parallel processing
- data points
- data distribution
- data mining techniques
- data quality
- prior knowledge
- complex data
- synthetic data
- high quality
- data sets
- original data
- data collection
- database
- data structure
- statistical analysis
- end users
- data analysis
- raw data
- small number
- training data
- statistical methods
- experimental data
- image data
- knowledge discovery
- data processing