Login / Signup
Model-free adaptive optimal control policy for Markov jump systems: A value iterations algorithm.
Peixin Zhou
Jiwei Wen
Akshya Kumar Swain
Xiaoli Luan
Published in:
J. Syst. Control. Eng. (2022)
Keyphrases
</>
model free
control policy
optimal solution
reinforcement learning
dynamic programming
computational complexity
learning algorithm
approximate dynamic programming
control policies
markov chain
least squares
objective function
monte carlo
np hard
policy iteration
search space
average reward
data mining