Optimizing Data Locality by Executor Allocation in Reduce Stage for Spark Framework.
Zhongming FuMengsi HeZhuo TangYang ZhangPublished in: PDCAT (2021)
Keyphrases
- data sets
- data collection
- data analysis
- data processing
- synthetic data
- complex data
- data quality
- end users
- database
- statistical analysis
- spatial data
- data structure
- probability distribution
- original data
- data acquisition
- raw data
- computer systems
- noisy data
- multimedia data
- information retrieval
- data sources
- high quality
- training data