Login / Signup
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching.
Hana Hoshino
Kei Ota
Asako Kanezaki
Rio Yokota
Published in:
ICRA (2022)
Keyphrases
</>
inverse reinforcement learning
artificial intelligence
random variables