Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL.
Arambam James SinghAkshat KumarHoong Chuin LauPublished in: ICAPS (2021)
Keyphrases
- reinforcement learning
- multi agent
- learning process
- learning algorithm
- learning agents
- learning models
- machine learning
- learning systems
- online learning
- accurate models
- active learning
- probabilistic model
- supervised learning
- prior knowledge
- inverse reinforcement learning
- temporal difference learning
- real world
- temporal difference methods
- learning agent
- function approximation
- autonomous agents
- optimal policy
- model selection
- cooperative