Login / Signup
Approximate Function Evaluation via Multi-Armed Bandits.
Tavor Z. Baharav
Gary Cheng
Mert Pilanci
David Tse
Published in:
CoRR (2022)
Keyphrases
</>
multi armed bandits
bandit problems
special case
dynamic programming
probability distribution
online learning
markov decision processes
piecewise linear