Distributed Training on a Highly Heterogeneous HPC System.
José FlichCarles HernándezEduardo QuiñonesRoberto ParedesPublished in: SAMOS (2020)
Keyphrases
- highly heterogeneous
- distributed systems
- training set
- cooperative
- lightweight
- high performance computing
- multi agent
- distributed environment
- training process
- mobile agents
- information retrieval
- training data
- distributed computing
- fault tolerance
- communication overhead
- test set
- distributed data
- training algorithm
- fault tolerant
- computer networks
- data sets
- training examples
- online learning
- information systems
- genetic algorithm
- databases