Sign in
Shwai He
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 16
Top Topics
Machine Translation
Computational Efficiency
Expert Advice
Arbitrary Topology
Top Venues
CoRR
EMNLP
ACL (1)
WMT
</>
Publications
</>
Ming Li
,
Lichang Chen
,
Jiuhai Chen
,
Shwai He
,
Jiuxiang Gu
,
Tianyi Zhou
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning.
CoRR
(2024)
Ming Li
,
Yong Zhang
,
Shwai He
,
Zhitao Li
,
Hongyu Zhao
,
Jianzong Wang
,
Ning Cheng
,
Tianyi Zhou
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning.
CoRR
(2024)
Shwai He
,
Run-Ze Fan
,
Liang Ding
,
Li Shen
,
Tianyi Zhou
,
Dacheng Tao
MerA: Merging Pretrained Adapters For Few-Shot Learning.
CoRR
(2023)
Shwai He
,
Run-Ze Fan
,
Liang Ding
,
Li Shen
,
Tianyi Zhou
,
Dacheng Tao
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts.
EMNLP
(2023)
Chenbo Jiang
,
Jie Yang
,
Shwai He
,
Yu-Kun Lai
,
Lin Gao
NeuralSlice: Neural 3D Triangle Mesh Reconstruction via Slicing 4D Tetrahedral Meshes.
ICML
(2023)
Shwai He
,
Chenbo Jiang
,
Daize Dong
,
Liang Ding
SD-Conv: Towards the Parameter-Efficiency of Dynamic Convolution.
WACV
(2023)
Ming Li
,
Lichang Chen
,
Jiuhai Chen
,
Shwai He
,
Heng Huang
,
Jiuxiang Gu
,
Tianyi Zhou
Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning.
CoRR
(2023)
Shwai He
,
Run-Ze Fan
,
Liang Ding
,
Li Shen
,
Tianyi Zhou
,
Dacheng Tao
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts.
CoRR
(2023)
Shwai He
,
Liang Ding
,
Daize Dong
,
Boan Liu
,
Fuqiang Yu
,
Dacheng Tao
PAD-Net: An Efficient Framework for Dynamic Networks.
ACL (1)
(2023)
Shwai He
,
Liang Ding
,
Daize Dong
,
Boan Liu
,
Fuqiang Yu
,
Dacheng Tao
Cherry Hypothesis: Identifying the Cherry on the Cake for Dynamic Networks.
CoRR
(2022)
Changtong Zan
,
Keqin Peng
,
Liang Ding
,
Baopu Qiu
,
Boan Liu
,
Shwai He
,
Qingyu Lu
,
Zheng Zhang
,
Chuang Liu
,
Weifeng Liu
,
Yibing Zhan
,
Dacheng Tao
Vega-MT: The JD Explore Academy Translation System for WMT22.
CoRR
(2022)
Shwai He
,
Yuhang Li
,
Chenbo Jiang
,
Shi Gu
When Sparsity Meets Dynamic Convolution.
CoRR
(2022)
Shwai He
,
Liang Ding
,
Daize Dong
,
Miao Zhang
,
Dacheng Tao
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters.
CoRR
(2022)
Changtong Zan
,
Keqin Peng
,
Liang Ding
,
Baopu Qiu
,
Boan Liu
,
Shwai He
,
Qingyu Lu
,
Zheng Zhang
,
Chuang Liu
,
Weifeng Liu
,
Yibing Zhan
,
Dacheng Tao
Vega-MT: The JD Explore Academy Machine Translation System for WMT22.
WMT
(2022)
Shwai He
,
Liang Ding
,
Daize Dong
,
Jeremy Zhang
,
Dacheng Tao
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters.
EMNLP (Findings)
(2022)
Shwai He
,
Shi Gu
Multi-modal Attention Network for Stock Movements Prediction.
CoRR
(2021)