Strategies for Fault-Tolerant Tightly-Coupled HPC Workloads Running on Low-Budget Spot Cloud Infrastructures.
Vanderlei MunhozMárcio CastroOdorico MendizabalPublished in: SBAC-PAD (2022)
Keyphrases
- fault tolerant
- tightly coupled
- fault tolerance
- computing infrastructure
- loosely coupled
- distributed systems
- fine grained
- high performance computing
- general purpose
- high availability
- cloud computing
- load balancing
- distributed computing
- cloud platform
- state machine
- safety critical
- mobile agent system
- data management
- database systems
- high level
- database
- high assurance
- interconnection networks
- computing resources
- case study