Keyphrases
- reinforcement learning
- reinforcement learning algorithms
- estimation accuracy
- temporal difference
- estimation error
- function approximation
- learning algorithm
- parameter estimation
- evaluation function
- supervised learning
- estimation algorithm
- maximum likelihood estimation
- model free
- dynamic programming
- artificial intelligence
- data sets
- autonomous learning
- multi agent reinforcement learning
- policy search