Sign in

Reinforcement Learning to Rank with Pairwise Policy Gradient.

Jun XuZeng WeiLong XiaYanyan LanDawei YinXueqi ChengJi-Rong Wen
Published in: SIGIR (2020)
Keyphrases