An Improved Task Scheduling Algorithm Based on Cache Locality and Data Locality in Hadoop.
Peng ZhangChunlin LiYahui ZhaoPublished in: PDCAT (2016)
Keyphrases
- data sets
- database
- big data
- spatial locality
- image data
- small number
- experimental data
- data collection
- statistical analysis
- data mining
- data quality
- data objects
- data distribution
- data analysis
- data mining techniques
- cloud computing
- computer systems
- synthetic data
- data processing
- raw data
- distributed systems
- data points
- training data
- distributed computing
- computing power
- social media