Login / Signup
Yaqi Duan
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 28
Top Topics
Td Learning
Reinforcement Learning
Policy Evaluation
Markov Chain
Top Venues
CoRR
ICML
NeurIPS
L4DC
</>
Publications
</>
Zitian Huo
,
Yaqi Duan
,
Dongdong Zhan
,
Xizhen Xu
,
Nairen Zheng
,
Jing Cai
,
Ruifang Sun
,
Jianping Wang
,
Fang Cheng
,
Zhan Gao
,
Caixia Xu
,
Wanlin Liu
,
Yuting Dong
,
Sailong Ma
,
Qian Zhang
,
Yiyun Zheng
,
Liping Lou
,
Dong Kuang
,
Qian Chu
,
Jun Qin
,
Guoping Wang
,
Yi Wang
Proteomic Stratification of Prognosis and Treatment Options for Small Cell Lung Cancer.
Genom. Proteom. Bioinform.
22 (1) (2024)
Yaqi Duan
,
Martin J. Wainwright
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces.
CoRR
(2024)
Yaqi Duan
,
Martin J. Wainwright
A finite-sample analysis of multi-step temporal difference estimates.
L4DC
(2023)
Chengzhuo Ni
,
Yaqi Duan
,
Munther Dahleh
,
Mengdi Wang
,
Anru R. Zhang
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition.
J. Mach. Learn. Res.
24 (2023)
Aihua Mao
,
Zihui Du
,
Junhui Hou
,
Yaqi Duan
,
Yong-Jin Liu
,
Ying He
PU-Flow: A Point Cloud Upsampling Network With Normalizing Flows.
IEEE Trans. Vis. Comput. Graph.
29 (12) (2023)
Aihua Mao
,
Yaqi Duan
,
Yu-Hui Wen
,
Zihui Du
,
Hongmin Cai
,
Yong-Jin Liu
Invertible Residual Neural Networks with Conditional Injector and Interpolator for Point Cloud Upsampling.
IJCAI
(2023)
Ming Yin
,
Yaqi Duan
,
Mengdi Wang
,
Yu-Xiang Wang
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism.
ICLR
(2022)
Yaqi Duan
,
Kaizheng Wang
Adaptive and Robust Multi-task Learning.
CoRR
(2022)
Yaqi Duan
,
Jinglong Chen
,
Tianci Zhang
,
Shuilong He
,
Yong Feng
,
Jingsong Xie
,
Wenrong Xiao
High-temperature augmented neighborhood metric learning for cross-domain fault diagnosis with imbalanced data.
Knowl. Based Syst.
257 (2022)
Ming Yin
,
Yaqi Duan
,
Mengdi Wang
,
Yu-Xiang Wang
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism.
CoRR
(2022)
Yaqi Duan
,
Martin J. Wainwright
Policy evaluation from a single path: Multi-step methods, mixing and mis-specification.
CoRR
(2022)
Yaqi Duan
,
Chi Jin
,
Zhiyuan Li
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning.
CoRR
(2021)
Chengzhuo Ni
,
Anru R. Zhang
,
Yaqi Duan
,
Mengdi Wang
Learning Good State and Action Representations via Tensor Decomposition.
ISIT
(2021)
Yaqi Duan
,
Chi Jin
,
Zhiyuan Li
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning.
ICML
(2021)
Botao Hao
,
Xiang Ji
,
Yaqi Duan
,
Hao Lu
,
Csaba Szepesvári
,
Mengdi Wang
Bootstrapping Statistical Inference for Off-Policy Evaluation.
CoRR
(2021)
Chengzhuo Ni
,
Anru Zhang
,
Yaqi Duan
,
Mengdi Wang
Learning Good State and Action Representations via Tensor Decomposition.
CoRR
(2021)
Botao Hao
,
Yaqi Duan
,
Tor Lattimore
,
Csaba Szepesvári
,
Mengdi Wang
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient.
ICML
(2021)
Yaqi Duan
,
Mengdi Wang
,
Martin J. Wainwright
Optimal policy evaluation using kernel-based temporal difference methods.
CoRR
(2021)
Aihua Mao
,
Zihui Du
,
Junhui Hou
,
Yaqi Duan
,
Yong-Jin Liu
,
Ying He
PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows.
CoRR
(2021)
Botao Hao
,
Xiang Ji
,
Yaqi Duan
,
Hao Lu
,
Csaba Szepesvári
,
Mengdi Wang
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference.
ICML
(2021)
Yaqi Duan
,
Zeyu Jia
,
Mengdi Wang
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation.
ICML
(2020)
Yaqi Duan
,
Mengdi Wang
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation.
CoRR
(2020)
Botao Hao
,
Yaqi Duan
,
Tor Lattimore
,
Csaba Szepesvári
,
Mengdi Wang
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient.
CoRR
(2020)
Yaqi Duan
,
Mengdi Wang
,
Zaiwen Wen
,
Yaxiang Yuan
Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains.
SIAM J. Matrix Anal. Appl.
41 (1) (2020)
Yifan Sun
,
Yaqi Duan
,
Hao Gong
,
Mengdi Wang
Learning low-dimensional state embeddings and metastable clusters from time series data.
NeurIPS
(2019)
Yaqi Duan
,
Zheng Tracy Ke
,
Mengdi Wang
State Aggregation Learning from Markov Transition Data.
NeurIPS
(2019)
Yifan Sun
,
Yaqi Duan
,
Hao Gong
,
Mengdi Wang
Learning low-dimensional state embeddings and metastable clusters from time series data.
CoRR
(2019)
Yaqi Duan
,
Zheng Tracy Ke
,
Mengdi Wang
State Aggregation Learning from Markov Transition Data.
CoRR
(2018)