Login / Signup
Learning Cooperative Multi-Agent Policies With Partial Reward Decoupling.
Benjamin Freed
Aditya Kapoor
Ian Abraham
Jeff G. Schneider
Howie Choset
Published in:
IEEE Robotics Autom. Lett. (2022)
Keyphrases
</>
reinforcement learning
learning process
online learning
neural network
inductive inference
prior knowledge
active learning
learning scenarios
supervised learning
learning systems
bandit problems
learning problems
learning tasks
learning experience
dynamic programming
artificial intelligence
learning algorithm