Login / Signup
Finite-Time Analysis for Double Q-learning.
Huaqing Xiong
Lin Zhao
Yingbin Liang
Wei Zhang
Published in:
NeurIPS (2020)
Keyphrases
</>
reinforcement learning
cooperative
finite number
databases
information systems
image processing
database
case study
multi agent
optimal policy
monte carlo