Login / Signup

What Should Be Observed for Optimal Reward in POMDPs?

Alyzia-Maria KonstaAlberto Lluch LafuenteChristoph Matheja
Published in: CAV (3) (2024)
Keyphrases
  • reinforcement learning
  • dynamic programming
  • average reward
  • optimal solution
  • genetic algorithm
  • expected reward
  • worst case
  • markov decision processes
  • closed form
  • optimal strategy