Keyphrases
- reinforcement learning
- markov decision processes
- function approximation
- state space
- reward function
- reward shaping
- reinforcement learning algorithms
- model free
- policy search
- transfer learning
- learning algorithm
- optimal policy
- temporal difference learning
- multi agent
- action selection
- temporal difference
- learning problems
- original data
- learning process
- data sets
- partially observable
- hidden state
- learning agent
- policy iteration
- robotic control
- infinite horizon
- learning classifier systems
- optimal control
- supervised learning
- dynamic programming
- active learning
- information systems