Cyclophobic Reinforcement Learning.

Stefan Sylvius Wagner Peter Arndt Jan Robine Stefan Harmeling

Published in: Trans. Mach. Learn. Res. (2023)

Keyphrases

reinforcement learning
function approximation
state space
learning algorithm
model free
reinforcement learning algorithms
optimal policy
temporal difference learning
databases
machine learning
case study
temporal difference
continuous state
real time
stochastic approximation
supervised learning
semi supervised