Model-based Reinforcement Learning with Multi-step Plan Value Estimation.

Haoxin LinYihao SunJiaji ZhangYang Yu
Published in: CoRR (2022)
Keyphrases
  • multi step
  • model based reinforcement learning
  • markov decision processes
  • dynamic programming
  • state space
  • knn
  • reinforcement learning
  • semi supervised