Login / Signup
Q-learning as a monotone scheme.
Lingyi Yang
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
cooperative
multi agent
function approximation
upper bound
learning algorithm
data mining
data sets
objective function
decision making
multiresolution
machine learning
database
learning scheme
model free
representation scheme
detection scheme
temporal difference learning
recognition scheme