Login / Signup
Learning from eXtreme Bandit Feedback.
Romain Lopez
Inderjit S. Dhillon
Michael I. Jordan
Published in:
AAAI (2021)
Keyphrases
</>
learning systems
active learning
knowledge acquisition
learning algorithm
learning process
bayesian networks
online learning
incremental learning
reinforcement learning
multi agent
expert systems
supervised learning
unsupervised learning
background knowledge
assessment tool