A comparative between hadoop mapreduce and apache Spark on HDFS.

Mohamed Saouabi Abdellah Ezzati

Published in: IML (2017)

Keyphrases

mapreduce framework
cloud computing
map reduce
open source
large scale data sets
frequent itemset mining
open source software
web server
data management
efficient implementation
parallel computation
real time
distributed computing
mailing lists
linux kernel
data sets
comparative analysis
distributed processing
open source projects
parallel computing
data warehouse
web services
databases
commodity hardware