Teachable Reinforcement Learning via Advice Distillation.
Olivia WatkinsAbhishek GuptaTrevor DarrellPieter AbbeelJacob AndreasPublished in: NeurIPS (2021)
Keyphrases
- reinforcement learning
- state space
- function approximation
- multi agent
- learning algorithm
- reinforcement learning algorithms
- optimal policy
- multi agent reinforcement learning
- action space
- transfer learning
- real world
- markov decision processes
- real time
- supervised learning
- markov decision process
- learning environment
- stochastic approximation