Login / Signup
Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy.
Ruihan Yang
Qiwei Ye
Tie-Yan Liu
Published in:
CoRR (2019)
Keyphrases
</>
learning algorithm
optimal policy
prior knowledge
online learning
learning tasks
reinforcement learning
learning systems
highly efficient
learning process
knowledge acquisition
learning problems
effective learning
active exploration