Self Adaptive Application Level Fault Tolerance for Parallel and Distributed Computing.
Zizhong ChenMing YangGuillermo A. Francia IIIJack J. DongarraPublished in: IPDPS (2007)
Keyphrases
- application level
- fault tolerance
- parallel and distributed computing
- fault tolerant
- parallel programming
- operating system
- distributed computing
- distributed systems
- overlay network
- network management
- response time
- load balancing
- peer to peer
- quality of service
- mobile agents
- virtual machine
- parallel algorithm
- database replication
- massively parallel
- real time
- cloud computing
- computer systems
- multimedia
- parallel processing
- sensor nodes
- grid computing
- computing systems
- parallel computing
- programming language
- web services
- programming environment
- artificial intelligence
- failure recovery
- data mining
- database