Login / Signup
An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning.
Dhruv Malik
Malayandi Palaniappan
Jaime F. Fisac
Dylan Hadfield-Menell
Stuart J. Russell
Anca D. Dragan
Published in:
CoRR (2018)
Keyphrases
</>
cooperative
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
linear program
dynamic programming