Login / Signup
Lookahead-Bounded Q-learning.
Ibrahim El Shar
Daniel R. Jiang
Published in:
ICML (2020)
Keyphrases
</>
reinforcement learning
cooperative
function approximation
multi agent
state space
learning algorithm
model free
learning rate
action selection
optimal policy
temporal difference learning
stochastic approximation
multi agent reinforcement learning
information systems
state action