Optimizing the fault-tolerance overheads of HPC systems using prediction and multiple proactive actions.

Published in: J. Supercomput. (2015)

Keyphrases