Login / Signup
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error.
Haoran Li
Zicheng Zhang
Wang Luo
Congying Han
Yudong Hu
Tiande Guo
Shichen Liao
Published in:
CoRR (2024)
Keyphrases
</>
estimation error
multi agent
reinforcement learning
piecewise linear
cooperative
dynamic programming
minimum error
worst case
state space
evaluation function
multi agent systems
state action
error tolerance
optimal solution
closed form
threshold values
locally optimal