​
Login / Signup
Wanqi Xue
ORCID
Publication Activity (10 Years)
Years Active: 2019-2023
Publications (10 Years): 21
Top Topics
Actor Critic
User Engagement
Reinforcement Learning
Recommender Systems
Top Venues
CoRR
AAAI
KDD
IJCAI
</>
Publications
</>
Wanqi Xue
,
Qingpeng Cai
,
Ruohan Zhan
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
,
Bo An
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor.
ICLR
(2023)
Qingpeng Cai
,
Zhenghai Xue
,
Chi Zhang
,
Wanqi Xue
,
Shuchang Liu
,
Ruohan Zhan
,
Xueliang Wang
,
Tianyou Zuo
,
Wentao Xie
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
Two-Stage Constrained Actor-Critic for Short Video Recommendation.
CoRR
(2023)
Qingpeng Cai
,
Zhenghai Xue
,
Chi Zhang
,
Wanqi Xue
,
Shuchang Liu
,
Ruohan Zhan
,
Xueliang Wang
,
Tianyou Zuo
,
Wentao Xie
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
Two-Stage Constrained Actor-Critic for Short Video Recommendation.
WWW
(2023)
Wanqi Xue
,
Qingpeng Cai
,
Zhenghai Xue
,
Shuo Sun
,
Shuchang Liu
,
Dong Zheng
,
Peng Jiang
,
Kun Gai
,
Bo An
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.
KDD
(2023)
Wanqi Xue
,
Bo An
,
Shuicheng Yan
,
Zhongwen Xu
Reinforcement Learning from Diverse Human Preferences.
CoRR
(2023)
Shuxin Li
,
Xinrun Wang
,
Youzhi Zhang
,
Wanqi Xue
,
Jakub CernĂ½
,
Bo An
Solving Large-Scale Pursuit-Evasion Games Using Pre-trained Strategies.
AAAI
(2023)
Shuo Sun
,
Xinrun Wang
,
Wanqi Xue
,
Xiaoxuan Lou
,
Bo An
Mastering Stock Markets with Efficient Mixture of Diversified Trading Experts.
KDD
(2023)
Wanqi Xue
,
Bo An
,
Chai Kiat Yeo
NSGZero: Efficiently Learning Non-exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search.
AAAI
(2022)
Shuo Sun
,
Wanqi Xue
,
Rundong Wang
,
Xu He
,
Junlei Zhu
,
Jian Li
,
Bo An
DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities.
CIKM
(2022)
Wanqi Xue
,
Qingpeng Cai
,
Ruohan Zhan
,
Dong Zheng
,
Peng Jiang
,
Bo An
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor.
CoRR
(2022)
Wanqi Xue
,
Wei Qiu
,
Bo An
,
Zinovi Rabinovich
,
Svetlana Obraztsova
,
Chai Kiat Yeo
Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning.
AAMAS
(2022)
Wanqi Xue
,
Qingpeng Cai
,
Zhenghai Xue
,
Shuo Sun
,
Shuchang Liu
,
Dong Zheng
,
Peng Jiang
,
Bo An
PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement.
CoRR
(2022)
Wanqi Xue
,
Bo An
,
Chai Kiat Yeo
NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search.
CoRR
(2022)
Wanqi Xue
,
Youzhi Zhang
,
Shuxin Li
,
Xinrun Wang
,
Bo An
,
Chai Kiat Yeo
Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play.
IJCAI
(2021)
Wanqi Xue
,
Wei Qiu
,
Bo An
,
Zinovi Rabinovich
,
Svetlana Obraztsova
,
Chai Kiat Yeo
Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning.
CoRR
(2021)
Wanqi Xue
,
Youzhi Zhang
,
Shuxin Li
,
Xinrun Wang
,
Bo An
,
Chai Kiat Yeo
Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play.
CoRR
(2021)
Shuxin Li
,
Youzhi Zhang
,
Xinrun Wang
,
Wanqi Xue
,
Bo An
CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space.
IJCAI
(2021)
Shuxin Li
,
Youzhi Zhang
,
Xinrun Wang
,
Wanqi Xue
,
Bo An
CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space.
CoRR
(2021)
Wanqi Xue
,
Wei Wang
One-Shot Image Classification by Learning to Restore Prototypes.
AAAI
(2020)
Wanqi Xue
,
Wei Wang
One-Shot Image Classification by Learning to Restore Prototypes.
CoRR
(2020)
Yi Sen Ng
,
Wanqi Xue
,
Wei Wang
,
Panpan Qi
Convolutional Neural Networks for Food Image Recognition: An Experimental Study.
MADiMa @ ACM Multimedia
(2019)