Login / Signup
Off-Policy Adversarial Inverse Reinforcement Learning.
Samin Yeasar Arnob
Published in:
CoRR (2020)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
multi agent
temporal difference
data mining
artificial intelligence
special case
decision making
step size