Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch.

Published in: J. Mach. Learn. Res. (2022)

Keyphrases