Login / Signup
LocMoE+: Enhanced Router with Token Feature Awareness for Efficient LLM Pre-Training.
Jing Li
Zhijie Sun
Dachao Lin
Xuan He
Yi Lin
Binfan Zheng
Li Zeng
Rongqian Zhao
Xin Chen
Published in:
CoRR (2024)
Keyphrases
</>
feature vectors
training set
cost effective
end to end
machine learning
decision trees
case study
object recognition
artificial neural networks
training process