Login / Signup
AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers.
Andrey Kurenkov
Ajay Mandlekar
Roberto Martin Martin
Silvio Savarese
Animesh Garg
Published in:
CoRR (2019)
Keyphrases
</>
learning algorithm
reinforcement learning
actor critic
learning process
objective function
cost function
active learning
neural network
least squares
supervised learning
mathematical model
learning problems
policy iteration