Login / Signup
SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies.
Seyed Kamyar Seyed Ghasemipour
Shixiang Gu
Richard S. Zemel
Published in:
NeurIPS (2019)
Keyphrases
</>
inverse reinforcement learning
reward function
bayesian nonparametric
mixture model
partially observable environments
reinforcement learning
optimal policy
preference elicitation