Keyphrases
- reinforcement learning
- function approximation
- optimal policy
- reinforcement learning algorithms
- state space
- model free
- control problems
- machine learning
- temporal difference learning
- temporal difference
- databases
- transition model
- reinforcement learning methods
- markov decision process
- action selection
- sufficient conditions
- dynamic programming
- multi agent
- optimal control
- markov decision processes
- supervised learning
- learning process
- robot control
- control policy
- image sequences
- artificial intelligence
- database
- robotic control