A comparative between hadoop mapreduce and apache Spark on HDFS.
Mohamed SaouabiAbdellah EzzatiPublished in: IML (2017)
Keyphrases
- mapreduce framework
- cloud computing
- map reduce
- open source
- large scale data sets
- frequent itemset mining
- open source software
- web server
- data management
- efficient implementation
- parallel computation
- real time
- distributed computing
- mailing lists
- linux kernel
- data sets
- comparative analysis
- distributed processing
- open source projects
- parallel computing
- data warehouse
- web services
- databases
- commodity hardware