Login / Signup
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning.
Enmin Zhao
Renye Yan
Jinqiu Li
Kai Li
Junliang Xing
Published in:
AAAI (2022)
Keyphrases
</>
end to end
artificial intelligence
reinforcement learning
ad hoc networks
wireless ad hoc networks
high bandwidth
congestion control
admission control
monte carlo
multi hop
multipath
markov decision processes
content delivery
application layer
optimal policy
internet protocol
transport layer