Login / Signup
Bo Dai
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 26
Top Topics
Utility Maximization
Policy Gradient Methods
Reinforcement Learning
Diffusion Models
Top Venues
CoRR
ICLR
NeurIPS
CDC
</>
Publications
</>
Hanjun Dai
,
Bethany Wang
,
Xingchen Wan
,
Bo Dai
,
Sherry Yang
,
Azade Nova
,
Pengcheng Yin
,
Phitchaya Mangpo Phothilimthana
,
Charles Sutton
,
Dale Schuurmans
UQE: A Query Engine for Unstructured Databases.
CoRR
(2024)
Fengdi Che
,
Chenjun Xiao
,
Jincheng Mei
,
Bo Dai
,
Ramki Gummadi
,
Oscar A Ramirez
,
Christopher K. Harris
,
A. Rupam Mahmood
,
Dale Schuurmans
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.
CoRR
(2024)
Dmitry Shribak
,
Chen-Xiao Gao
,
Yitong Li
,
Chenjun Xiao
,
Bo Dai
Diffusion Spectral Representation for Reinforcement Learning.
CoRR
(2024)
Jincheng Mei
,
Zixin Zhong
,
Bo Dai
,
Alekh Agarwal
,
Csaba Szepesvári
,
Dale Schuurmans
Stochastic Gradient Succeeds for Bandits.
CoRR
(2024)
Tongzheng Ren
,
Haotian Sun
,
Antoine Moulin
,
Arthur Gretton
,
Bo Dai
Spectral Representation for Causal Estimation with Hidden Confounders.
CoRR
(2024)
Yang Hu
,
Haitong Ma
,
Bo Dai
,
Na Li
Efficient Duple Perturbation Robustness in Low-rank MDPs.
CoRR
(2024)
Haitong Ma
,
Zhaolin Ren
,
Bo Dai
,
Na Li
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint.
CoRR
(2024)
Yuchen Zhuang
,
Haotian Sun
,
Yue Yu
,
Rushi Qiang
,
Qifan Wang
,
Chao Zhang
,
Bo Dai
HYDRA: Model Factorization Framework for Black-Box LLM Personalization.
CoRR
(2024)
Shicong Cen
,
Jincheng Mei
,
Hanjun Dai
,
Dale Schuurmans
,
Yuejie Chi
,
Bo Dai
Beyond Expectations: Learning with Stochastic Dominance Made Practical.
CoRR
(2024)
Sherry Yang
,
Yilun Du
,
Bo Dai
,
Dale Schuurmans
,
Joshua B. Tenenbaum
,
Pieter Abbeel
Probabilistic Adaptation of Black-Box Text-to-Video Models.
ICLR
(2024)
Haotian Sun
,
Yuchen Zhuang
,
Wei Wei
,
Chao Zhang
,
Bo Dai
BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models.
CoRR
(2024)
Shicong Cen
,
Jincheng Mei
,
Katayoon Goshvadi
,
Hanjun Dai
,
Tong Yang
,
Sherry Yang
,
Dale Schuurmans
,
Yuejie Chi
,
Bo Dai
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.
CoRR
(2024)
Jiayi Chen
,
Hanjun Dai
,
Bo Dai
,
Aidong Zhang
,
Wei Wei
On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval.
EMNLP (Findings)
(2023)
Tongzheng Ren
,
Tianjun Zhang
,
Lisa Lee
,
Joseph E. Gonzalez
,
Dale Schuurmans
,
Bo Dai
Spectral Decomposition Representation for Reinforcement Learning.
ICLR
(2023)
Tianjun Zhang
,
Tongzheng Ren
,
Chenjun Xiao
,
Wenli Xiao
,
Joseph E. Gonzalez
,
Dale Schuurmans
,
Bo Dai
Energy-based Predictive Representations for Partially Observed Reinforcement Learning.
UAI
(2023)
Jiayi Chen
,
Hanjun Dai
,
Bo Dai
,
Aidong Zhang
,
Wei Wei
On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval.
CoRR
(2023)
Tongzheng Ren
,
Chenjun Xiao
,
Tianjun Zhang
,
Na Li
,
Zhaoran Wang
,
Sujay Sanghavi
,
Dale Schuurmans
,
Bo Dai
Latent Variable Representation for Reinforcement Learning.
ICLR
(2023)
Jincheng Mei
,
Bo Dai
,
Alekh Agarwal
,
Mohammad Ghavamzadeh
,
Csaba Szepesvári
,
Dale Schuurmans
Ordering-based Conditions for Global Convergence of Policy Gradient Methods.
NeurIPS
(2023)
Haoran Sun
,
Lijun Yu
,
Bo Dai
,
Dale Schuurmans
,
Hanjun Dai
Score-based Continuous-time Discrete Diffusion Models.
ICLR
(2023)
Yilun Du
,
Sherry Yang
,
Bo Dai
,
Hanjun Dai
,
Ofir Nachum
,
Josh Tenenbaum
,
Dale Schuurmans
,
Pieter Abbeel
Learning Universal Policies via Text-Guided Video Generation.
NeurIPS
(2023)
Lingkai Kong
,
Wenhao Mu
,
Jiaming Cui
,
Yuchen Zhuang
,
B. Aditya Prakash
,
Bo Dai
,
Chao Zhang
DF2: Distribution-Free Decision-Focused Learning.
CoRR
(2023)
Jincheng Mei
,
Zixin Zhong
,
Bo Dai
,
Alekh Agarwal
,
Csaba Szepesvári
,
Dale Schuurmans
Stochastic Gradient Succeeds for Bandits.
ICML
(2023)
Tongzheng Ren
,
Zhaolin Ren
,
Na Li
,
Bo Dai
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding.
CDC
(2023)
Haoran Sun
,
Bo Dai
,
Charles Sutton
,
Dale Schuurmans
,
Hanjun Dai
Any-scale Balanced Samplers for Discrete Space.
ICLR
(2023)
Hongming Zhang
,
Tongzheng Ren
,
Chenjun Xiao
,
Dale Schuurmans
,
Bo Dai
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning.
CoRR
(2023)
Haotian Sun
,
Yuchen Zhuang
,
Lingkai Kong
,
Bo Dai
,
Chao Zhang
AdaPlanner: Adaptive Planning from Feedback with Language Models.
NeurIPS
(2023)