Login / Signup

Lucid dreaming for experience replay: refreshing past states with the current policy.

Yunshu DuGarrett WarnellAssefaw H. GebremedhinPeter StoneMatthew E. Taylor
Published in: Neural Comput. Appl. (2022)
Keyphrases
  • information systems
  • asymptotically optimal
  • databases
  • probability distribution
  • user experience
  • decision trees
  • image sequences
  • hidden markov models
  • optimal policy