Does Zero-Shot Reinforcement Learning Exist?
Ahmed TouatiJérémy RapinYann OllivierPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- function approximation
- multi agent
- decision making
- control problems
- state space
- model free
- optimal control
- optimal policy
- direct policy search
- machine learning
- robotic control
- temporal difference
- markov decision processes
- evolutionary algorithm
- learning process
- bayesian networks
- learning algorithm