Teraflops Supercomputer: Architecture and Validation of the Fault Tolerance Mechanisms.
Cristian ConstantinescuPublished in: IEEE Trans. Computers (2000)
Keyphrases
- fault tolerance
- fault tolerant
- massively parallel
- distributed computing
- load balancing
- high availability
- distributed query processing
- distributed systems
- building blocks
- response time
- fault management
- high performance computing
- database replication
- management system
- peer to peer
- group communication
- replicated databases
- single point of failure
- data replication
- failure recovery
- floating point
- fine grained
- database
- grid computing
- smart card
- replica control