Login / Signup
Rethinking Adversarial Inverse Reinforcement Learning: From the Angles of Policy Imitation and Transferable Reward Recovery.
Yangchun Zhang
Yirui Zhou
Published in:
CoRR (2024)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
reward function
bayesian nonparametric
preference elicitation
reinforcement learning
temporal difference
multi agent
markov decision processes
optimal policy
machine learning
partially observable
decision making
transition probabilities