Login / Signup
Learning nonlinear feedback controllers from data via optimal policy search and stochastic gradient descent.
Laura Ferrarotti
Alberto Bemporad
Published in:
CDC (2020)
Keyphrases
</>
prior knowledge
stochastic gradient descent
dynamic programming
learning algorithm
reinforcement learning
data points
policy search
computational complexity
online learning
loss function
learning tasks
input space
matrix factorization