Login / Signup
Efficient Counterfactual Learning from Bandit Feedback.
Yusuke Narita
Shota Yasui
Kohei Yata
Published in:
CoRR (2018)
Keyphrases
</>
learning process
learning algorithm
reinforcement learning
learning tasks
computer programming
efficient learning
knowledge base
prior knowledge
active learning
relevance feedback
knowledge acquisition
unsupervised learning
erroneous examples