Login / Signup
Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and Inverse Learning.
Shenghui Chen
Yue Yu
David Fridovich-Keil
Ufuk Topcu
Published in:
CoRR (2023)
Keyphrases
</>
learning process
learning algorithm
supervised learning
reinforcement learning
linear program
nash equilibrium
reinforcement learning algorithms
temporal difference learning