Login / Signup
Scalable Methods for Computing State Similarity in Deterministic Markov Decision Processes.
Pablo Samuel Castro
Published in:
AAAI (2020)
Keyphrases
</>
markov decision processes
state space
optimal policy
reinforcement learning
dynamic programming
multi agent
infinite horizon
reinforcement learning algorithms
finite horizon