Reinforcement Learning with Random Delays.

Yann Bouteiller Simon Ramstedt Giovanni Beltrame Christopher J. Pal Jonathan Binas

Published in: ICLR (2021)

Keyphrases

reinforcement learning
state space
function approximation
databases
policy search
database
markov decision processes
optimal policy
learning process
information systems
machine learning
optimal solution
website
artificial intelligence
optimal control
model free
action selection
reinforcement learning algorithms
learning capabilities
control problems
temporal difference learning
stochastic approximation
transition model