​
Login / Signup
Yan Dai
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 13
Top Topics
Policy Iteration
Function Approximation
Reinforcement Learning Algorithms
E Learning
Top Venues
CoRR
ICML
NeurIPS
ICLR
</>
Publications
</>
Yan Dai
,
Qiwen Cui
,
Simon S. Du
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation (Extended Abstract).
COLT
(2024)
Kwangjun Ahn
,
Zhiyu Zhang
,
Yunbum Kook
,
Yan Dai
Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise.
CoRR
(2024)
Yan Dai
,
Qiwen Cui
,
Simon S. Du
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation.
CoRR
(2024)
Jiatai Huang
,
Yan Dai
,
Longbo Huang
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning.
ICML
(2023)
Yan Dai
,
Haipeng Luo
,
Chen-Yu Wei
,
Julian Zimmert
Refined Regret for Adversarial MDPs with Linear Function Approximation.
ICML
(2023)
Yan Dai
,
Ruosong Wang
,
Simon Shaolei Du
Variance-Aware Sparse Linear Bandits.
ICLR
(2023)
Yan Dai
,
Haipeng Luo
,
Chen-Yu Wei
,
Julian Zimmert
Refined Regret for Adversarial MDPs with Linear Function Approximation.
CoRR
(2023)
Yan Dai
,
Kwangjun Ahn
,
Suvrit Sra
The Crucial Role of Normalization in Sharpness-Aware Minimization.
NeurIPS
(2023)
Jiatai Huang
,
Yan Dai
,
Longbo Huang
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning.
CoRR
(2023)
Jiatai Huang
,
Yan Dai
,
Longbo Huang
Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits.
ICML
(2022)
Yan Dai
,
Ruosong Wang
,
Simon S. Du
Variance-Aware Sparse Linear Bandits.
CoRR
(2022)
Yan Dai
,
Haipeng Luo
,
Liyu Chen
Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback.
CoRR
(2022)
Yan Dai
,
Haipeng Luo
,
Liyu Chen
Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback.
NeurIPS
(2022)