Keyphrases
- reinforcement learning
- function approximation
- routing algorithm
- machine learning
- temporal difference learning
- learning classifier systems
- multi processor
- temporal difference
- markov decision processes
- transfer learning
- state space
- learning algorithm
- data sets
- learning process
- reinforcement learning algorithms
- multi agent
- network on chip
- multi agent reinforcement learning
- reinforcement learning methods
- model free
- markov decision process
- action selection
- optimal control
- transition model
- robotic control
- learning problems