CatchAndRetry: extending exceptions to handle distributed system failures and recovery.
Emre KicimanBenjamin LivshitsMadanlal MusuvathiPublished in: PLOS@SOSP (2009)
Keyphrases
- distributed systems
- failure recovery
- fault tolerance
- load balancing
- fault tolerant
- failure detection
- distributed environment
- message passing
- geographically distributed
- mobile agents
- concurrent systems
- distributed database systems
- distributed computing
- operating system
- software architecture
- end to end
- information management
- mobile computing
- data replication