Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems.
Derong LiuQinglai WeiPengfei YanPublished in: IEEE Trans. Syst. Man Cybern. Syst. (2015)
Keyphrases
- nonlinear systems
- policy iteration
- dynamic programming
- adaptive control
- markov decision processes
- reinforcement learning
- adaptive neural
- optimal policy
- finite state
- optimal control
- infinite horizon
- control law
- dead zone
- markov decision problems
- fuzzy systems
- fuzzy model
- approximate dynamic programming
- linear programming
- state space
- fuzzy control
- average reward
- fixed point
- learning rate
- least squares
- markov chain
- control method
- fuzzy controller
- actor critic
- markov decision process
- temporal difference
- model free
- neural network
- convergence rate
- real time