Double Deep Q-Learning in Opponent Modeling.

Yangtianze Tao John Doe

Published in: CoRR (2022)

Keyphrases

opponent modeling
reinforcement learning
imperfect information
game playing
cooperative
multi agent
function approximation
state space
learning algorithm
optimal policy
dynamic programming
game tree search
model free
data structure
learning process
learning tools
action selection