Login / Signup
Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective.
Dylan J. Foster
Alexander Rakhlin
David Simchi-Levi
Yunzong Xu
Published in:
COLT (2021)
Keyphrases
</>
reinforcement learning
learning algorithm
contextual information
space complexity
viewpoint
function approximation
state space
context sensitive
worst case
machine learning
information retrieval
optimal solution
active learning
computational cost
action selection