Login / Signup
Model-free stabilization via Extremum Seeking using a cost neural estimator.
Sara Dubbioso
Azarakhsh Jalalvand
Josiah Wai
Gianmaria De Tommasi
Egemen Kolemen
Published in:
Expert Syst. Appl. (2024)
Keyphrases
</>
model free
reinforcement learning
reinforcement learning algorithms
function approximation
neural network
temporal difference
policy iteration
network architecture
policy evaluation
maximum likelihood
least squares
machine learning
average reward