Login / Signup
A Dynamic Regret Analysis and Adaptive Regularization Algorithm for On-Policy Robot Imitation Learning.
Jonathan N. Lee
Michael Laskey
Ajay Kumar Tanwani
Anil Aswani
Ken Goldberg
Published in:
WAFR (2018)
Keyphrases
</>
learning algorithm
dynamic programming
bayesian networks
parameter selection
imitation learning
adaptive regularization
cost function
expectation maximization
computer vision
prior knowledge
mobile robot
probabilistic model
blind image deconvolution