Login / Signup
Optimal and Approximate Q-value Functions for Decentralized POMDPs
Frans A. Oliehoek
Matthijs T. J. Spaan
Nikos A. Vlassis
Published in:
CoRR (2011)
Keyphrases
</>
dynamic programming
reinforcement learning
dec pomdps
cooperative
multi agent
optimal solution
bayesian networks
worst case
markov decision processes
distributed constraint optimization
linear combination of basis
supply chain
peer to peer
dynamic environments
piecewise linear