Login / Signup
CFR-p: Counterfactual Regret Minimization with Hierarchical Policy Abstraction, and its Application to Two-player Mahjong.
Shiheng Wang
Published in:
CoRR (2023)
Keyphrases
</>
regret minimization
nash equilibrium
game theoretic
game theory
optimal policy
broadly applicable
high level
decision problems
multi agent learning
worst case
hierarchical structure
markov decision problems