Benchmarking Apache Spark and Hadoop MapReduce on Big Data Classification.
Taha TekdoganAli CakmakPublished in: ICCBDC (2021)
Keyphrases
- big data
- cloud computing
- data intensive
- data analytics
- data management
- data analysis
- open source
- data intensive computing
- unstructured data
- social media
- massive data
- map reduce
- vast amounts of data
- data processing
- business intelligence
- high volume
- big data analytics
- data science
- data mining
- knowledge discovery
- parallel processing
- predictive modeling
- data warehouse
- commodity hardware
- machine learning
- information processing
- distributed computing
- data warehousing
- model selection
- distributed systems
- mapreduce framework
- management system
- database