Reinforcement Learning with Random Delays.
Yann BouteillerSimon RamstedtGiovanni BeltrameChristopher J. PalJonathan BinasPublished in: ICLR (2021)
Keyphrases
- reinforcement learning
- state space
- function approximation
- databases
- policy search
- database
- markov decision processes
- optimal policy
- learning process
- information systems
- machine learning
- optimal solution
- website
- artificial intelligence
- optimal control
- model free
- action selection
- reinforcement learning algorithms
- learning capabilities
- control problems
- temporal difference learning
- stochastic approximation
- transition model