The Efficacy of Pessimism in Asynchronous Q-Learning.

Yuling Yan Gen Li Yuxin Chen Jianqing Fan

Published in: IEEE Trans. Inf. Theory (2023)

Keyphrases

reinforcement learning
cooperative
multi agent
state space
function approximation
learning algorithm
reinforcement learning algorithms
learning rate
multi agent reinforcement learning
optimal policy
action selection
data sets
potential field
stochastic approximation
model free
discussion forums
information systems
real time
asynchronous communication
delay insensitive