Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.

Published in: CoRR (2021)

Keyphrases