Learning and Exploiting Shaped Reward Models for Large Scale Multiagent RL.

Arambam James Singh Akshat Kumar Hoong Chuin Lau

Published in: ICAPS (2021)

Keyphrases

reinforcement learning
multi agent
learning process
learning algorithm
learning agents
learning models
machine learning
learning systems
online learning
accurate models
active learning
probabilistic model
supervised learning
prior knowledge
inverse reinforcement learning
temporal difference learning
real world
temporal difference methods
learning agent
function approximation
autonomous agents
optimal policy
model selection
cooperative