Reconciling Rewards with Predictive State Representations.

Andrea Baisero Christopher Amato

Published in: CoRR (2021)

Keyphrases

predictive state representations
dynamical systems
reinforcement learning
temporal difference
stochastic systems
markov decision processes
past observations
partially observable markov decision processes
state space
finite state