Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL.

Published in: CoRR (2024)

Keyphrases