Login / Signup
Policy gradient adaptive dynamic programming for nonlinear discrete-time zero-sum games with unknown dynamics.
Mingduo Lin
Bo Zhao
Derong Liu
Published in:
Soft Comput. (2023)
Keyphrases
</>
dynamic programming
policy gradient
single input single output
optimal control
actor critic
reinforcement learning
state space
markov chain
function approximation
dead zone
machine learning
linear programming
real valued