Login / Signup

An Approximately Optimal Relative Value Learning Algorithm for Averaged MDPs with Continuous States and Actions.

Hiteshi SharmaRahul Jain
Published in: Allerton (2019)
Keyphrases