Login / Signup
Achievability of asymptotic minimax regret by horizon-dependent and horizon-independent strategies.
Kazuho Watanabe
Teemu Roos
Published in:
J. Mach. Learn. Res. (2015)
Keyphrases
</>
minimax regret
worst case
data sets
decision making
reinforcement learning
dynamic programming