Login / Signup
Qiwei Di
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 7
Top Topics
Partially Observable Markov Decision Processes
Reinforcement Learning
Levenberg Marquardt
Regret Bounds
Top Venues
CoRR
ICLR
ICML
</>
Publications
</>
Qiwei Di
,
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.
CoRR
(2024)
Qiwei Di
,
Tao Jin
,
Yue Wu
,
Heyang Zhao
,
Farzad Farnoud
,
Quanquan Gu
Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits.
ICLR
(2024)
Qiwei Di
,
Jiafan He
,
Quanquan Gu
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback.
CoRR
(2024)
Qiwei Di
,
Heyang Zhao
,
Jiafan He
,
Quanquan Gu
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning.
ICLR
(2024)
Qiwei Di
,
Heyang Zhao
,
Jiafan He
,
Quanquan Gu
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning.
CoRR
(2023)
Qiwei Di
,
Tao Jin
,
Yue Wu
,
Heyang Zhao
,
Farzad Farnoud
,
Quanquan Gu
Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits.
CoRR
(2023)
Qiwei Di
,
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.
ICML
(2023)