PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning.
Angelos FilosClare LyleYarin GalSergey LevineNatasha JaquesGregory FarquharPublished in: CoRR (2021)
Keyphrases
- temporal difference learning
- reinforcement learning
- function approximation
- learning algorithm
- learning process
- fixed point
- learning tasks
- reinforcement learning algorithms
- prior knowledge
- temporal difference
- evaluation function
- game playing
- function approximators
- supervised learning
- multi agent
- state space
- cost function
- action selection
- feature space