Login / Signup
Globally Convergent Policy Search for Output Estimation.
Jack Umenberger
Max Simchowitz
Juan C. Perdomo
Kaiqing Zhang
Russ Tedrake
Published in:
NeurIPS (2022)
Keyphrases
</>
policy search
globally convergent
variational inequalities
autocalibration
reinforcement learning
continuous state
global convergence
newton method
dynamic programming
reinforcement learning algorithms
neural network
machine learning
maximum likelihood estimation