​
Login / Signup
Yu Bai
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 21
Top Topics
Function Approximation
Bayes Net
Reinforcement Learning
Markov Games
Top Venues
CoRR
ICLR
NeurIPS
ICML
</>
Publications
</>
Licong Lin
,
Yu Bai
,
Song Mei
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining.
ICLR
(2024)
Tianyu Guo
,
Wei Hu
,
Song Mei
,
Huan Wang
,
Caiming Xiong
,
Silvio Savarese
,
Yu Bai
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations.
ICLR
(2024)
Jiacheng Guo
,
Minshuo Chen
,
Huan Wang
,
Caiming Xiong
,
Mengdi Wang
,
Yu Bai
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight.
ICLR
(2024)
Minshuo Chen
,
Yu Bai
,
H. Vincent Poor
,
Mengdi Wang
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations.
CoRR
(2023)
Fan Chen
,
Yu Bai
,
Song Mei
Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms.
ICLR
(2023)
Yuanhao Wang
,
Dingwen Kong
,
Yu Bai
,
Chi Jin
Learning Rationalizable Equilibria in Multiplayer Games.
ICLR
(2023)
Tianyu Guo
,
Wei Hu
,
Song Mei
,
Huan Wang
,
Caiming Xiong
,
Silvio Savarese
,
Yu Bai
How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations.
CoRR
(2023)
Fan Chen
,
Huan Wang
,
Caiming Xiong
,
Song Mei
,
Yu Bai
Lower Bounds for Learning in Revealing POMDPs.
ICML
(2023)
Hengyu Fu
,
Tianyu Guo
,
Yu Bai
,
Song Mei
What can a Single Attention Layer Learn? A Study Through the Random Features Lens.
CoRR
(2023)
Jiacheng Guo
,
Minshuo Chen
,
Huan Wang
,
Caiming Xiong
,
Mengdi Wang
,
Yu Bai
Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight.
CoRR
(2023)
Licong Lin
,
Yu Bai
,
Song Mei
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining.
CoRR
(2023)
Yu Bai
,
Fan Chen
,
Huan Wang
,
Caiming Xiong
,
Song Mei
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection.
CoRR
(2023)
Aadyot Bhatnagar
,
Huan Wang
,
Caiming Xiong
,
Yu Bai
Improved Online Conformal Prediction via Strongly Adaptive Online Learning.
ICML
(2023)
Hengyu Fu
,
Tianyu Guo
,
Yu Bai
,
Song Mei
What can a Single Attention Layer Learn? A Study Through the Random Features Lens.
NeurIPS
(2023)
Yuanhao Wang
,
Qinghua Liu
,
Yu Bai
,
Chi Jin
Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation.
COLT
(2023)
Minshuo Chen
,
Yu Bai
,
H. Vincent Poor
,
Mengdi Wang
Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations.
NeurIPS
(2023)
Tengyang Xie
,
Dylan J. Foster
,
Yu Bai
,
Nan Jiang
,
Sham M. Kakade
The Role of Coverage in Online Reinforcement Learning.
ICLR
(2023)
Runyu Zhang
,
Qinghua Liu
,
Huan Wang
,
Caiming Xiong
,
Na Li
,
Yu Bai
Policy Optimization for Markov Games: Unified Framework and Faster Convergence.
NeurIPS
(2022)
Yu Bai
,
Chi Jin
,
Song Mei
,
Ziang Song
,
Tiancheng Yu
Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent.
NeurIPS
(2022)
Ziang Song
,
Song Mei
,
Yu Bai
Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games.
NeurIPS
(2022)
Eshaan Nichani
,
Yu Bai
,
Jason D. Lee
Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials.
NeurIPS
(2022)