Login / Signup
From External to Swap Regret 2.0: An Efficient Reduction for Large Action Spaces.
Yuval Dagan
Constantinos Daskalakis
Maxwell Fishelson
Noah Golowich
Published in:
STOC (2024)
Keyphrases
</>
action space
state space
markov decision processes
real valued
state and action spaces
online learning
graph cuts
stochastic processes
continuous state
reinforcement learning
lower bound
skill learning
continuous action
data mining
search space