Checkpoint/Restart and Beyond: Resilient High Performance Computing with FPGAs.
Andrew G. SchmidtBin HuangRon SassMatthew FrenchPublished in: FCCM (2011)
Keyphrases
- high performance computing
- fault tolerance
- hardware software
- fault tolerant
- scientific computing
- field programmable gate array
- parallel computing
- massively parallel
- computing systems
- load balancing
- distributed systems
- distributed computing
- national laboratory
- computational science
- molecular dynamics
- response time
- computing resources
- grid computing
- energy efficiency
- heterogeneous computing
- message passing interface
- mobile agents
- peer to peer
- parallel architectures
- low cost