Login / Signup
Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model.
Haruka Kiyohara
Yuta Saito
Tatsuya Matsuhiro
Yusuke Narita
Nobuyuki Shimizu
Yasuo Yamamoto
Published in:
CoRR (2022)
Keyphrases
</>
probabilistic model
objective function
machine learning
em algorithm
optimal policy