Login / Signup
Understanding Parameter Sharing in Transformers.
Ye Lin
Mingxuan Wang
Zhexi Zhang
Xiaohui Wang
Tong Xiao
Jingbo Zhu
Published in:
CoRR (2023)
Keyphrases
</>
parameter values
deeper understanding
data sets
databases
genetic algorithm
data structure
knowledge sharing
optimal parameters