Filter based Explorized Policy Iteration Algorithm for On-Policy Approximate LQR.
Sumit Kumar JhaSayan Basu RoyShubhendu BhasinPublished in: SSCI (2019)
Keyphrases
- policy iteration algorithm
- finite state
- markov decision processes
- reinforcement learning
- policy iteration
- optimal control
- policy evaluation
- infinite horizon
- actor critic
- optimal policy
- least squares
- markov chain
- markov decision process
- dynamic programming
- partially observable markov decision processes
- state space
- average cost
- temporal difference
- model checking
- monte carlo
- long run
- exact solution
- decision processes