Login / Signup
Repeated Inverse Reinforcement Learning.
Kareem Amin
Nan Jiang
Satinder P. Singh
Published in:
CoRR (2017)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
machine learning
bayesian networks
fuzzy logic
simple examples