Sign in

Pessimistic Reward Models for Off-Policy Learning in Recommendation.

Olivier JeunenBart Goethals
Published in: RecSys (2021)
Keyphrases