Login / Signup
Density Estimators of the Cumulative Reward Up to a Hitting Time to a Rarely Visited Set of a Regenerative System.
Marvin K. Nakayama
Bruno Tuffin
Published in:
WSC (2022)
Keyphrases
</>
reinforcement learning
markov chain
density estimation
prior knowledge
gaussian mixture model