Opponent Aware Reinforcement Learning.
Víctor GallegoRoi NaveiroDavid Ríos InsuaDavid Gómez-UllatePublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- state space
- model free
- robotic control
- agent receives
- learning algorithm
- control problems
- direct policy search
- current situation
- reinforcement learning algorithms
- optimal policy
- transfer learning
- action selection
- temporal difference
- partially observable
- cooperative
- decision trees
- imperfect information
- information systems
- machine learning
- real time