Optimizing Asynchronous Multi-Level Checkpoint/Restart Configurations with Machine Learning.
Tonmoy DeyKento SatoBogdan NicolaeJian GuoJens DomkeWeikuan YuFranck CappelloKathryn MohrorPublished in: IPDPS Workshops (2020)
Keyphrases
- machine learning
- random walk
- learning algorithm
- machine learning algorithms
- computer vision
- decision trees
- data mining
- feature selection
- fault tolerance
- pattern recognition
- learning systems
- information extraction
- inductive learning
- fault tolerant
- learning problems
- machine learning methods
- transfer learning
- machine learning approaches
- asynchronous circuits
- natural language processing
- data analysis
- computational intelligence
- reinforcement learning
- computer science
- neural network
- supervised machine learning
- explanation based learning
- artificial intelligence
- multi layer
- information systems
- support vector
- training set
- mobile agents
- support vector machine
- supervised learning
- knowledge acquisition