Multi-job Hadoop scheduling to process geo-distributed big data.
Marco CavalloGiuseppe Di ModicaCarmelo PolitoOrazio TomarchioPublished in: ISCC (2017)
Keyphrases
- big data
- data intensive
- cloud computing
- commodity hardware
- data management
- data analysis
- data processing
- vast amounts of data
- social media
- scheduling problem
- unstructured data
- distributed systems
- peer to peer
- big data analytics
- data science
- open source
- data warehousing
- case study
- real world
- database
- massive data
- information systems
- databases