C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Deep Bayesian Reward Learning from Preferences.
Daniel S. Brown
Scott Niekum
Published in:
CoRR (2019)
Keyphrases
</>
reinforcement learning
learning algorithm
learning process
supervised learning
online learning
active learning
decision trees
unsupervised learning
learning systems
mobile learning
markov decision processes
solving problems
learning capabilities
state action