Login / Signup
Reconciling Rewards with Predictive State Representations.
Andrea Baisero
Christopher Amato
Published in:
CoRR (2021)
Keyphrases
</>
predictive state representations
dynamical systems
reinforcement learning
temporal difference
stochastic systems
markov decision processes
past observations
partially observable markov decision processes
state space
finite state