Login / Signup
Regret Analysis in Deterministic Reinforcement Learning.
Damianos Tranos
Alexandre Proutière
Published in:
CDC (2021)
Keyphrases
</>
reinforcement learning
machine learning
online learning
function approximation
temporal difference
database
neural network
information retrieval
data analysis
active learning
statistical analysis
markov decision processes
quantitative analysis