Login / Signup
Policy Improvement via Imitation of Multiple Oracles.
Ching-An Cheng
Andrey Kolobov
Alekh Agarwal
Published in:
NeurIPS (2020)
Keyphrases
</>
case study
reinforcement learning
significant improvement
real world
decision trees