Multiagent Reinforcement Learning: Rollout and Policy Iteration for POMDP With Application to Multirobot Problems.
Sushmita BhattacharyaSiva KailasSahil BadyalStephanie GilDimitri P. BertsekasPublished in: IEEE Trans. Robotics (2024)
Keyphrases
- multi robot
- markov decision processes
- policy iteration
- markov decision process
- path planning
- reinforcement learning
- multiagent reinforcement learning
- optimal policy
- markov games
- mobile robot
- vision system
- finite state
- function approximation
- average reward
- partially observable markov decision processes
- model free
- infinite horizon
- mathematical programming
- machine learning
- model checking
- state space
- np hard
- learning algorithm