Generalized Hindsight for Reinforcement Learning.
Alexander C. LiLerrel PintoPieter AbbeelPublished in: NeurIPS (2020)
Keyphrases
- reinforcement learning
- function approximation
- action selection
- information systems
- learning algorithm
- real time
- artificial intelligence
- machine learning
- markov decision process
- learning problems
- database
- multi agent reinforcement learning
- reinforcement learning methods
- policy search
- learning agents
- reinforcement learning algorithms
- direct policy search
- temporal difference
- model free
- state space
- expert systems
- decision trees
- decision making
- databases
- data sets