Breaking Down Hadoop Distributed File Systems Data Analytics Tools: Apache Hive vs. Apache Pig vs. Pivotal HWAQ.
Xin ChenLiting HuLiangqi LiuJing ChangDiana Leante BonePublished in: CLOUD (2017)
Keyphrases
- map reduce
- data analytics
- open source
- file system
- commodity hardware
- cloud computing
- parallel computation
- open source projects
- efficient implementation
- open source software
- scalable distributed
- recently developed
- source code
- community detection
- big data
- case study
- storage systems
- data analysis
- distributed systems
- parallel computing
- join operations
- data sets
- data model
- database