Login / Signup
On the impact of tangled program graph marking schemes under the atari reinforcement learning benchmark.
Alexandru Ianta
Ryan Amaral
Caleidgh Bayer
Robert J. Smith
Malcolm I. Heywood
Published in:
GECCO (2021)
Keyphrases
</>
reinforcement learning
linear value function approximation
random walk
state space
function approximation
learning algorithm
optimal policy
machine learning
neural network
action selection
decision making
multi agent
transfer learning
markov models