Login / Signup
A Unified Bellman Equation for Causal Information and Value in Markov Decision Processes.
Stas Tiomkin
Naftali Tishby
Published in:
CoRR (2017)
Keyphrases
</>
markov decision processes
dynamic programming
state space
reinforcement learning
optimal policy
finite state
data mining
markov decision process
finite horizon
planning under uncertainty