Login / Signup
Kaiyue Wen
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 14
Top Topics
Deep Learning
Quality Assessment
High Noise
Case Study
Top Venues
CoRR
ICLR
NeurIPS
EMNLP
</>
Publications
</>
Kaiyue Wen
,
Xingyu Dang
,
Kaifeng Lyu
RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval.
CoRR
(2024)
Haozhe Jiang
,
Kaiyue Wen
,
Yilei Chen
Practically Solving LPN in High Noise Regimes Faster Using Neural Networks.
CoRR
(2023)
Haozhe Jiang
,
Kaiyue Wen
,
Yilei Chen
Practically Solving LPN in High Noise Regimes Faster Using Neural Networks.
IACR Cryptol. ePrint Arch.
2023 (2023)
Kaiyue Wen
,
Zhiyuan Li
,
Tengyu Ma
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization.
NeurIPS
(2023)
Kaiyue Wen
,
Yuchen Li
,
Bingbin Liu
,
Andrej Risteski
Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars.
NeurIPS
(2023)
Kaiyue Wen
,
Zhiyuan Li
,
Tengyu Ma
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization.
CoRR
(2023)
Kaiyue Wen
,
Jiaye Teng
,
Jingzhao Zhang
Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models.
ICLR
(2023)
Kaiyue Wen
,
Tengyu Ma
,
Zhiyuan Li
How Sharpness-Aware Minimization Minimizes Sharpness?
ICLR
(2023)
Kaiyue Wen
,
Yuchen Li
,
Bingbin Liu
,
Andrej Risteski
Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars.
CoRR
(2023)
Kaiyue Wen
,
Tengyu Ma
,
Zhiyuan Li
How Does Sharpness-Aware Minimization Minimize Sharpness?
CoRR
(2022)
Kaiyue Wen
,
Jiaye Teng
,
Jingzhao Zhang
Realistic Deep Learning May Not Fit Benignly.
CoRR
(2022)
Xiaozhi Wang
,
Kaiyue Wen
,
Zhengyan Zhang
,
Lei Hou
,
Zhiyuan Liu
,
Juanzi Li
Finding Skill Neurons in Pre-trained Transformer-based Language Models.
EMNLP
(2022)
Yusheng Su
,
Xiaozhi Wang
,
Yujia Qin
,
Chi-Min Chan
,
Yankai Lin
,
Huadong Wang
,
Kaiyue Wen
,
Zhiyuan Liu
,
Peng Li
,
Juanzi Li
,
Lei Hou
,
Maosong Sun
,
Jie Zhou
On Transferability of Prompt Tuning for Natural Language Processing.
NAACL-HLT
(2022)
Xiaozhi Wang
,
Kaiyue Wen
,
Zhengyan Zhang
,
Lei Hou
,
Zhiyuan Liu
,
Juanzi Li
Finding Skill Neurons in Pre-trained Transformer-based Language Models.
CoRR
(2022)