Keyphrases
- reinforcement learning
- function approximation
- state space
- temporal difference
- optimal policy
- learning algorithm
- reinforcement learning algorithms
- multi agent reinforcement learning
- dynamic programming
- temporal difference learning
- control problems
- model free
- machine learning
- markov decision processes
- multi agent
- transfer learning
- supervised learning
- policy search
- robotic control
- partially observable
- direct policy search
- state information
- decision problems
- action selection
- bayesian networks
- image sequences
- decision making
- information retrieval