Login / Signup
Towards Accurate and Model-Free QT Correction.
Esa Räsänen
Ilya Potapov
Janne Solanpää
Katriina Aalto-Setälä
Published in:
CinC (2021)
Keyphrases
</>
model free
reinforcement learning
function approximation
reinforcement learning algorithms
temporal difference
machine learning
policy iteration
policy evaluation
decision trees
dynamic programming
markov decision processes
radial basis function