Reachability in continuous-time Markov reward decision processes.
Christel BaierBoudewijn R. HaverkortHolger HermannsJoost-Pieter KatoenPublished in: Logic and Automata (2008)
Keyphrases
- decision processes
- markov chain
- state space
- markov decision processes
- reinforcement learning
- markov processes
- average reward
- reward function
- stationary policies
- decision problems
- optimal policy
- decision process
- finite state
- markov process
- decision making
- heuristic search
- dynamical systems
- reasoning process
- partially observable
- transition probabilities
- optimal control
- conditional independence
- state variables
- dynamic programming
- multi agent
- long run
- planning problems
- decision support system
- stochastic processes
- partially observable markov decision processes
- decision makers
- prior knowledge
- knowledge base