Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning.

Carl Qi Pieter Abbeel Aditya Grover

Published in: CoRR (2022)

Keyphrases

learning systems
learning algorithm
learning process
complex domains
mobile learning
supervised learning
online learning
active learning
reinforcement learning
mobile robot
decision makers
neural network
computationally efficient
search algorithm
partial occlusion
decision theoretic
goal directed
action selection
stochastic domains