Online Fault Classification in HPC Systems Through Machine Learning.
Alessio NettiZeynep KiziltanÖzalp BabaogluAlina SîrbuAndrea BartoliniAndrea BorghesiPublished in: Euro-Par (2019)
Keyphrases
- machine learning
- decision trees
- classification systems
- pattern recognition
- machine learning methods
- machine learning algorithms
- supervised machine learning
- support vector machine
- machine learning approaches
- text classification
- classification accuracy
- preprocessing
- feature selection
- learning algorithm
- feature space
- fault tolerance
- learning systems
- reinforcement learning
- knowledge based systems
- image classification
- expert systems
- feature vectors
- support vector
- supervised learning
- data mining
- automatic classification
- classification method
- distributed systems
- benchmark datasets
- supervised classification
- model selection
- computer systems
- knowledge acquisition
- online learning
- computer vision
- computing systems
- class labels
- classification models
- cost sensitive
- data analysis
- management system
- unsupervised learning
- real time
- computational intelligence