Imitating, Fast and Slow: Robust learning from demonstrations via decision-time planning.
Carl QiPieter AbbeelAditya GroverPublished in: CoRR (2022)
Keyphrases
- learning systems
- learning algorithm
- learning process
- complex domains
- mobile learning
- supervised learning
- online learning
- active learning
- reinforcement learning
- mobile robot
- decision makers
- neural network
- computationally efficient
- search algorithm
- partial occlusion
- decision theoretic
- goal directed
- action selection
- stochastic domains