Login / Signup

Deterministic policies based on maximum regrets in MDPs with imprecise rewards.

Pegah AlizadehEmiliano TraversiAomar Osmani
Published in: AI Commun. (2021)
Keyphrases