Login / Signup
Output feedback Q-learning for discrete-time finite-horizon zero-sum games with application to the H∞ control.
Mingxiang Liu
Qianqian Cai
Dandan Li
Wei Meng
Minyue Fu
Published in:
Neurocomputing (2023)
Keyphrases
</>
finite horizon
optimal policy
reinforcement learning
state space
control system
markov chain
markov decision processes
finite state
optimal stopping
steady state
infinite horizon
multistage
optimal control
control policies