Login / Signup
Counterfactual equivalence for POMDPs, and underlying deterministic environments.
Stuart Armstrong
Published in:
CoRR (2018)
Keyphrases
</>
reinforcement learning
dynamic environments
state space
markov decision processes
black box
belief state
highly dynamic
database
real time
data sets
robotic systems
autonomous robots
causal reasoning
finite state automaton