Understanding the Effects of Communication and Coordination on Checkpointing at Scale.
Kurt B. FerreiraPatrick M. WidenerScott LevyDorian C. ArnoldTorsten HoeflerPublished in: SC (2014)
Keyphrases
- mechanisms underlying
- distributed control
- information sharing
- distributed databases
- communication systems
- information exchange
- agent communication
- multi agent
- information flows
- small scale
- failure recovery
- main memory databases
- multi agent reinforcement learning
- low overhead
- fault tolerance
- multi agent systems
- cooperative
- scale invariant
- communication overhead
- distributed database systems
- communication cost
- communication networks
- load balancing
- scale space