Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- state space
- reinforcement learning algorithms
- learning algorithm
- real world
- temporal difference
- mobile robot
- optimal policy
- multi agent
- model free
- deep learning
- dynamic programming
- support vector
- image sequences
- action selection
- knowledge base
- database
- learning agents
- temporal difference learning