Deep Residual Reinforcement Learning.
Shangtong ZhangWendelin BoehmerShimon WhitesonPublished in: AAMAS (2020)
Keyphrases
- reinforcement learning
- function approximation
- multi agent
- state space
- reinforcement learning algorithms
- temporal difference
- multi agent reinforcement learning
- temporal difference learning
- control problems
- databases
- learning problems
- transfer learning
- optimal policy
- transition model
- model free
- policy search
- continuous state
- direct policy search
- function approximators
- robot control
- markov decision processes
- supervised learning
- learning algorithm
- machine learning