Workload characterization on a production Hadoop cluster: A case study on Taobao.
Zujie RenXianghua XuJian WanWeisong ShiMin ZhouPublished in: IISWC (2012)
Keyphrases
- case study
- clustering algorithm
- open source
- cloud computing
- production system
- response time
- production planning
- distributed systems
- data clustering
- database workloads
- distributed computing
- big data
- test bed
- hierarchical clustering
- mapreduce framework
- hierarchical structure
- production cost
- map reduce
- production scheduling
- index selection
- knowledge discovery