​
Login / Signup
Wei Xiong
Publication Activity (10 Years)
Years Active: 2023-2023
Publications (10 Years): 8
Top Topics
Bayesian Models
Multi Armed Bandits
Reinforcement Learning
Function Approximation
Top Venues
CoRR
ISIT
IEEE Trans. Signal Process.
ICML
</>
Publications
</>
Zhihan Liu
,
Miao Lu
,
Wei Xiong
,
Han Zhong
,
Hao Hu
,
Shenao Zhang
,
Sirui Zheng
,
Zhuoran Yang
,
Zhaoran Wang
One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration.
CoRR
(2023)
Chengshuai Shi
,
Wei Xiong
,
Cong Shen
,
Jing Yang
Reward Teaching for Federated Multi-armed Bandits.
ISIT
(2023)
Chengshuai Shi
,
Wei Xiong
,
Cong Shen
,
Jing Yang
Reward Teaching for Federated Multi-armed Bandits.
CoRR
(2023)
Chengshuai Shi
,
Wei Xiong
,
Cong Shen
,
Jing Yang
Reward Teaching for Federated Multiarmed Bandits.
IEEE Trans. Signal Process.
71 (2023)
Wei Xiong
,
Han Zhong
,
Chengshuai Shi
,
Cong Shen
,
Liwei Wang
,
Tong Zhang
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game.
ICLR
(2023)
Shizhe Diao
,
Rui Pan
,
Hanze Dong
,
Kashun Shum
,
Jipeng Zhang
,
Wei Xiong
,
Tong Zhang
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models.
CoRR
(2023)
Hanze Dong
,
Wei Xiong
,
Deepanshu Goyal
,
Yihan Zhang
,
Winnie Chow
,
Rui Pan
,
Shizhe Diao
,
Jipeng Zhang
,
Kashun Shum
,
Tong Zhang
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment.
Trans. Mach. Learn. Res.
2023 (2023)
Chengshuai Shi
,
Wei Xiong
,
Cong Shen
,
Jing Yang
Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources.
ICML
(2023)