Sample-Efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs.

Published in: AAAI (2022)

Keyphrases