Lookahead-Bounded Q-learning.

Ibrahim El Shar Daniel R. Jiang

Published in: ICML (2020)

Keyphrases

reinforcement learning
cooperative
function approximation
multi agent
state space
learning algorithm
model free
learning rate
action selection
optimal policy
temporal difference learning
stochastic approximation
multi agent reinforcement learning
information systems
state action