Login / Signup
Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT.
James Lee-Thorp
Joshua Ainslie
Published in:
EMNLP (Findings) (2022)
Keyphrases
</>
three dimensional
data mining
high dimensional
neural network
artificial intelligence
information systems
feature selection
decision making
e learning
image processing
digital libraries
cost effective
combining multiple
sparse data