The Efficacy of Pessimism in Asynchronous Q-Learning.
Yuling YanGen LiYuxin ChenJianqing FanPublished in: IEEE Trans. Inf. Theory (2023)
Keyphrases
- reinforcement learning
- cooperative
- multi agent
- state space
- function approximation
- learning algorithm
- reinforcement learning algorithms
- learning rate
- multi agent reinforcement learning
- optimal policy
- action selection
- data sets
- potential field
- stochastic approximation
- model free
- discussion forums
- information systems
- real time
- asynchronous communication
- delay insensitive