Distance Minimization for Reward Learning from Scored Trajectories.

Benjamin Burchfiel Carlo Tomasi Ronald Parr

Published in: AAAI (2016)

Keyphrases

reinforcement learning
learning systems
learning algorithm
learning problems
learning tasks
online learning
mobile learning
supervised learning
unsupervised learning
learning agent
background knowledge
probabilistic model
dynamic programming
active learning
learning environment
image segmentation
knowledge base