Login / Signup
Convergence and Near Optimality of Q-Learning with Finite Memory for Partially Observed Models.
Ali Devran Kara
Serdar Yüksel
Published in:
CDC (2021)
Keyphrases
</>
partially observed
multi agent
data sets
cooperative
statistical models
prior knowledge
dynamic environments
operating system
process model
statistical model
experimental data
main memory