A Data Distribution Aware Task Scheduling Strategy for MapReduce System.
Leitao GuoHongwei SunZhiguo LuoPublished in: CloudCom (2009)
Keyphrases
- data distribution
- scheduling strategy
- scheduling algorithm
- grid computing
- distributed computing
- round robin
- data streams
- index structure
- data points
- cloud computing
- streaming data
- high dimensional data
- data skew
- concept drift
- quality of service
- computing environments
- peer to peer
- response time
- data mining
- distributed environment
- neural network
- query processing
- feature vectors
- continuous queries
- feature extraction
- metadata