Delayed Feedback in Episodic Reinforcement Learning.
Benjamin HowsonCiara Pike-BurkeSarah FilippiPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- delayed feedback
- function approximation
- state space
- multi agent
- control problems
- machine learning
- markov decision processes
- reinforcement learning algorithms
- learning algorithm
- model free
- hopf bifurcation
- temporal difference
- robotic control
- direct policy search
- temporal difference learning
- relational reinforcement learning
- fitted q iteration
- learning classifier systems
- multi agent reinforcement learning
- real time
- learning problems
- expert systems
- search algorithm
- databases
- data sets