Login / Signup

CFR-p: Counterfactual Regret Minimization with Hierarchical Policy Abstraction, and its Application to Two-player Mahjong.

Shiheng Wang
Published in: CoRR (2023)
Keyphrases
  • regret minimization
  • nash equilibrium
  • game theoretic
  • game theory
  • optimal policy
  • broadly applicable
  • high level
  • decision problems
  • multi agent learning
  • worst case
  • hierarchical structure
  • markov decision problems