Big Data Processing Workflows Oriented Real-Time Scheduling Algorithm using Task-Duplication in Geo-Distributed Clouds.
Huangke ChenJinming WenWitold PedryczGuohua WuPublished in: IEEE Trans. Big Data (2020)
Keyphrases
- big data
- scheduling algorithm
- real time
- data processing
- cloud computing
- high volume
- load balance
- data intensive
- computing resources
- response time
- data intensive computing
- data management
- commodity hardware
- distributed computing
- computational grids
- scheduling strategy
- big data analytics
- grid environment
- vast amounts of data
- distributed systems
- data analysis
- distributed environment
- unstructured data
- knowledge discovery
- data science
- geographically distributed
- peer to peer
- information processing
- business intelligence
- map reduce
- massive datasets
- database
- grid computing
- machine learning
- data warehousing
- load balancing
- social media