Model-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian games.
Zhenhui XuTielong ShenMinyi HuangPublished in: Autom. (2023)
Keyphrases
- model free
- policy iteration
- reinforcement learning
- markov decision processes
- reinforcement learning algorithms
- linear quadratic
- policy evaluation
- function approximation
- optimal control
- average reward
- temporal difference
- least squares
- optimal policy
- fixed point
- finite state
- infinite horizon
- dynamic programming
- linear programming