Confident Approximate Policy Iteration for Efficient Local Planning in $q^\pi$-realizable MDPs.

Published in: NeurIPS (2022)

Keyphrases