Login / Signup
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees.
Yifei Zhou
Ayush Sekhari
Yuda Song
Wen Sun
Published in:
ICLR (2024)
Keyphrases
</>
neural network
probability distribution