Login / Signup
Deep Bayesian Reward Learning from Preferences.
Daniel S. Brown
Scott Niekum
Published in:
CoRR (2019)
Keyphrases
</>
reinforcement learning
learning algorithm
learning process
supervised learning
online learning
active learning
decision trees
unsupervised learning
learning systems
mobile learning
markov decision processes
solving problems
learning capabilities
state action