Keyphrases
- reinforcement learning
- function approximation
- machine learning
- markov decision processes
- optimal policy
- reinforcement learning algorithms
- state space
- partially observed
- deep learning
- model free
- transfer learning
- mobile robot
- dynamic programming
- learning process
- learning environment
- action selection
- temporal difference
- image sequences
- control problems
- markov decision process
- social networks
- temporal difference learning
- databases