Counter approach for the estimation of optimal sequences in Partially Observable Untimed Petri Nets.
Philippe DeclerckPublished in: Discret. Event Dyn. Syst. (2021)
Keyphrases
- petri net
- partially observable
- petri net model
- discrete event systems
- state space
- markov decision processes
- reinforcement learning
- dynamical systems
- decision problems
- colored petri nets
- stochastic petri net
- service composition
- partial observability
- dynamic programming
- partial observations
- optimal solution
- ims ld
- action models
- optimal control
- initially unknown
- partially observable environments
- hidden markov models
- reward function
- markov decision problems
- infinite horizon
- parameter estimation
- sufficient conditions