​
Login / Signup
Zihan Qiu
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 19
Top Topics
Partially Observable Environments
Language Model
Reinforcement Learning Problems
Actor Critic
Top Venues
CoRR
ChineseCSCW (2)
ACL (1)
Tiny Papers @ ICLR
</>
Publications
</>
Wenyu Du
,
Shuang Cheng
,
Tongxu Luo
,
Zihan Qiu
,
Zeyu Huang
,
Ka Chun Cheung
,
Reynold Cheng
,
Jie Fu
Unlocking Continual Learning Abilities in Language Models.
CoRR
(2024)
Yuemei Xu
,
Ling Hu
,
Jiayi Zhao
,
Zihan Qiu
,
Yuqi Ye
,
Hanwen Gu
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias.
CoRR
(2024)
Wenyu Du
,
Tongxu Luo
,
Zihan Qiu
,
Zeyu Huang
,
Yikang Shen
,
Reynold Cheng
,
Yike Guo
,
Jie Fu
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training.
CoRR
(2024)
Zihan Qiu
,
Zeyu Huang
,
Youcheng Huang
,
Jie Fu
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers.
Tiny Papers @ ICLR
(2024)
Zihan Qiu
,
Zeyu Huang
,
Jie Fu
Unlocking Emergent Modularity in Large Language Models.
NAACL-HLT
(2024)
Haoze Wu
,
Zihan Qiu
,
Zili Wang
,
Hang Zhao
,
Jie Fu
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory.
CoRR
(2024)
Hao Zhao
,
Zihan Qiu
,
Huijia Wu
,
Zili Wang
,
Zhaofeng He
,
Jie Fu
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts.
ACL (1)
(2024)
Tao Li
,
Lixing Wang
,
Zihan Qiu
,
Philippe Ciais
,
Taochun Sun
,
Matthew W. Jones
,
Robbie M. Andrew
,
Glen P. Peters
,
Piyu Ke
,
Xiaoting Huang
,
Robert B. Jackson
,
Zhu Liu
Reconstructing Global Daily CO2 Emissions via Machine Learning.
CoRR
(2024)
Hao Zhao
,
Zihan Qiu
,
Huijia Wu
,
Zili Wang
,
Zhaofeng He
,
Jie Fu
HyperMoE: Paying Attention to Unselected Experts in Mixture of Experts via Dynamic Transfer.
CoRR
(2024)
Zihan Qiu
,
Zeyu Huang
,
Youcheng Huang
,
Jie Fu
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers.
CoRR
(2024)
Ka Man Lo
,
Zeyu Huang
,
Zihan Qiu
,
Zili Wang
,
Jie Fu
A Closer Look into Mixture-of-Experts in Large Language Models.
CoRR
(2024)
Zihan Qiu
,
Zeyu Huang
,
Jie Fu
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
CoRR
(2023)
Zihan Qiu
,
Zhen Liu
,
Shuicheng Yan
,
Shanghang Zhang
,
Jie Fu
Heterogenous Memory Augmented Neural Networks.
CoRR
(2023)
Jialong Wu
,
Haixu Wu
,
Zihan Qiu
,
Jianmin Wang
,
Mingsheng Long
Supported Policy Optimization for Offline Reinforcement Learning.
CoRR
(2022)
Jialong Wu
,
Haixu Wu
,
Zihan Qiu
,
Jianmin Wang
,
Mingsheng Long
Supported Policy Optimization for Offline Reinforcement Learning.
NeurIPS
(2022)
Yu Lai
,
Liantao Lan
,
Rui Liang
,
Li Huang
,
Zihan Qiu
,
Yong Tang
A University Portrait System Incorporating Academic Social Network.
ChineseCSCW (2)
(2021)
Yongxu Long
,
Zihan Qiu
,
Dongyang Zheng
,
Zhengyang Wu
,
Jianguo Li
,
Yong Tang
ResConvE: Deeper Convolution-Based Knowledge Graph Embeddings.
ChineseCSCW (2)
(2021)
Zekai Zhou
,
Dongyang Zheng
,
Zihan Qiu
,
Ronghua Lin
,
Zhengyang Wu
,
Chengzhe Yuan
Academic Article Classification Algorithm Based on Pre-trained Model and Keyword Extraction.
ChineseCSCW (2)
(2021)
Zihan Qiu
,
Zekai Zhou
,
Yongxu Long
,
Chang Ji
,
Jianguo Li
,
Yong Tang
Detection of Advertising Users Based on K-SMOTE and Ensemble Learning.
HCC
(2021)