Keyphrases
- reinforcement learning
- markov decision processes
- model free
- function approximation
- multiple objectives
- reinforcement learning algorithms
- sufficient conditions
- artificial neural networks
- multi agent
- robotic control
- learning process
- state space
- transfer learning
- decision making
- temporal difference
- learning capabilities
- policy gradient
- priority queue