Delayed Reinforcement Learning by Imitation.
Pierre LiotetDavide MaranLorenzo BisiMarcello RestelliPublished in: ICML (2022)
Keyphrases
- reinforcement learning
- function approximation
- state space
- model free
- reinforcement learning algorithms
- multi agent
- learning process
- temporal difference
- imitation learning
- learning algorithm
- multi agent reinforcement learning
- optimal policy
- markov decision processes
- control problems
- optimal control
- dynamic programming
- action space
- reinforcement learning methods
- learning problems
- partially observable
- neural network
- supervised learning
- website
- genetic algorithm