Login / Signup
HSVI-based online minimax strategies for partially observable stochastic games with neural perception mechanisms.
Rui Yan
Gabriel Santos
Gethin Norman
David Parker
Marta Kwiatkowska
Published in:
L4DC (2024)
Keyphrases
</>
partially observable stochastic games
dynamic programming
multi agent
nash equilibrium
online learning
partially observable markov decision processes
network architecture
neural network
cooperative
optimal control
worst case
visual processing