Login / Signup
Double Deep Q-Learning in Opponent Modeling.
Yangtianze Tao
John Doe
Published in:
CoRR (2022)
Keyphrases
</>
opponent modeling
reinforcement learning
imperfect information
game playing
cooperative
multi agent
function approximation
state space
learning algorithm
optimal policy
dynamic programming
game tree search
model free
data structure
learning process
learning tools
action selection