Login / Signup

Future memories are not needed for large classes of POMDPs.

Victor CohenAxel Parmentier
Published in: Oper. Res. Lett. (2023)
Keyphrases
  • long term
  • reinforcement learning
  • dynamic programming
  • belief state
  • data sets
  • search algorithm
  • training set
  • associative memory
  • policy search
  • point based value iteration