Representations for Stable Off-Policy Reinforcement Learning.
Dibya GhoshMarc G. BellemarePublished in: ICML (2020)
Keyphrases
- reinforcement learning
- function approximation
- machine learning
- learning algorithm
- reinforcement learning algorithms
- temporal difference
- markov decision processes
- optimal policy
- robotic control
- state space
- higher level
- temporal difference learning
- information systems
- database
- multi agent
- similarity measure
- evaluation function
- clustering algorithm
- learning classifier systems
- model free
- search engine
- artificial intelligence
- multi agent reinforcement learning
- transition model
- databases