Sign in

Sample-efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs.

Siow Meng LowAkshat KumarScott Sanner
Published in: CoRR (2022)
Keyphrases