Login / Signup
A study of first-passage time minimization via Q-learning in heated gridworlds.
Maria A. Larchenko
Pavel Osinenko
Grigory Yaremenko
Vladimir V. Palyulin
Published in:
CoRR (2021)
Keyphrases
</>
empirical studies
experimental study
state space
stochastic approximation
cooperative
function approximation
database
databases
genetic algorithm
learning algorithm
statistical analysis