VA-learning as a more efficient alternative to Q-learning.
Yunhao TangRémi MunosMark RowlandMichal ValkoPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- learning algorithm
- multi agent
- learning process
- supervised learning
- relational reinforcement learning
- mobile learning
- unsupervised learning
- prior knowledge
- cooperative
- active learning
- neural network
- online learning
- knowledge acquisition
- learning systems
- learning tasks
- learning scheme
- td learning
- machine learning