Training Agents using Upside-Down Reinforcement Learning.
Rupesh Kumar SrivastavaPranav ShyamFilipe MutzWojciech JaskowskiJürgen SchmidhuberPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- multi agent
- learning agents
- multi agent systems
- autonomous agents
- multi agent environments
- multi agent reinforcement learning
- supervised learning
- intelligent agents
- agent receives
- dynamic environments
- multiagent systems
- learning agent
- action selection
- mobile agents
- cooperative
- multiple agents
- single agent
- reinforcement learning agents
- training samples
- partial observability
- agent model
- function approximation
- software agents
- multiagent learning
- training set
- agent behavior
- artificial agents
- temporal difference
- agent systems
- learning capabilities
- model free
- decision theoretic
- training process
- resource allocation
- learned knowledge
- learning process
- optimal policy
- optimal control
- learning tasks
- multiagent reinforcement learning
- robocup soccer
- machine learning