Delayed Feedback in Episodic Reinforcement Learning.

Benjamin Howson Ciara Pike-Burke Sarah Filippi

Published in: CoRR (2021)

Keyphrases

reinforcement learning
delayed feedback
function approximation
state space
multi agent
control problems
machine learning
markov decision processes
reinforcement learning algorithms
learning algorithm
model free
hopf bifurcation
temporal difference
robotic control
direct policy search
temporal difference learning
relational reinforcement learning
fitted q iteration
learning classifier systems
multi agent reinforcement learning
real time
learning problems
expert systems
search algorithm
databases
data sets