Opponent Aware Reinforcement Learning.

Víctor Gallego Roi Naveiro David Ríos Insua David Gómez-Ullate

Published in: CoRR (2019)

Keyphrases

reinforcement learning
function approximation
markov decision processes
state space
model free
robotic control
agent receives
learning algorithm
control problems
direct policy search
current situation
reinforcement learning algorithms
optimal policy
transfer learning
action selection
temporal difference
partially observable
cooperative
decision trees
imperfect information
information systems
machine learning
real time