Login / Signup
Distance Minimization for Reward Learning from Scored Trajectories.
Benjamin Burchfiel
Carlo Tomasi
Ronald Parr
Published in:
AAAI (2016)
Keyphrases
</>
reinforcement learning
learning systems
learning algorithm
learning problems
learning tasks
online learning
mobile learning
supervised learning
unsupervised learning
learning agent
background knowledge
probabilistic model
dynamic programming
active learning
learning environment
image segmentation
knowledge base