Login / Signup
Continuous-Time Fitted Value Iteration for Robust Policies.
Michael Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
Published in:
IEEE Trans. Pattern Anal. Mach. Intell. (2023)
Keyphrases
</>
optimal policy
state space
computationally efficient
markov decision processes
reinforcement learning
real time
heuristic search
parameter tuning
partially observable markov decision processes
markov decision process
data sets
learning algorithm
markov chain
dynamical systems
markov processes