Login / Signup

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition.

Yuta SaitoJihan YaoThorsten Joachims
Published in: CoRR (2024)
Keyphrases