Login / Signup
An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning.
Dhruv Malik
Malayandi Palaniappan
Jaime F. Fisac
Dylan Hadfield-Menell
Stuart J. Russell
Anca D. Dragan
Published in:
ICML (2018)
Keyphrases
</>
cooperative
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
artificial intelligence
control system
special case
probabilistic model
probability distribution
linear program