Locally weighted least squares policy iteration for model-free learning in uncertain environments.
Matthew HowardYoshihiko NakamuraPublished in: IROS (2013)
Keyphrases
- model free
- reinforcement learning
- locally weighted
- reinforcement learning methods
- learning process
- uncertain environments
- reinforcement learning algorithms
- learning algorithm
- temporal difference
- linear regression
- machine learning
- markov decision processes
- naive bayes
- policy iteration
- stochastic games
- state space
- training data