Representations for Stable Off-Policy Reinforcement Learning.

Dibya Ghosh Marc G. Bellemare

Published in: ICML (2020)

Keyphrases

reinforcement learning
function approximation
machine learning
learning algorithm
reinforcement learning algorithms
temporal difference
markov decision processes
optimal policy
robotic control
state space
higher level
temporal difference learning
information systems
database
multi agent
similarity measure
evaluation function
clustering algorithm
learning classifier systems
model free
search engine
artificial intelligence
multi agent reinforcement learning
transition model
databases