Login / Signup

Formally Verified Approximate Policy Iteration.

Maximilian SchäffelerMohammad Abdulaziz
Published in: CoRR (2024)
Keyphrases
  • approximate policy iteration
  • policy iteration
  • reinforcement learning
  • temporal difference
  • markov decision problems