Static and Dynamic Big Data Partitioning on Apache Spark.
Massimiliano BertolucciEmanuele CarliniPatrizio DazziAlessandro LulliLaura RicciPublished in: PARCO (2015)
Keyphrases
- big data
- open source
- cloud computing
- vast amounts of data
- unstructured data
- high volume
- data management
- knowledge discovery
- data intensive
- data processing
- social media
- open source software
- big data analytics
- data analysis
- map reduce
- data visualization
- massive data
- data science
- data warehousing
- business intelligence
- predictive modeling
- open source projects
- social computing
- massive datasets
- case study
- health informatics
- databases