Login / Signup

UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution.

Gengrui ZhangYao WangXiaoshuang ChenHongyi QianKaiqiao ZhanBen Wang
Published in: AAAI (2024)
Keyphrases