Login / Signup

Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming.

Xihong SuMarek Petrik
Published in: CoRR (2024)
Keyphrases
  • dynamic programming
  • markov decision processes
  • objective function
  • coordinate ascent
  • reinforcement learning
  • input data
  • decision theoretic