Login / Signup
Bandit Submodular Maximization for Multi-Robot Coordination in Unpredictable and Partially Observable Environments.
Zirui Xu
Xiaofeng Lin
Vasileios Tzoumas
Published in:
CoRR (2023)
Keyphrases
</>
partially observable environments
multi robot coordination
objective function
multi robot
inverse reinforcement learning
partially observable
reinforcement learning algorithms
reinforcement learning
cooperative games
markov chain
path planning
model free
partially observable markov decision processes