Login / Signup
On the Prunability of Attention Heads in Multilingual BERT.
Aakriti Budhraja
Madhura Pande
Pratyush Kumar
Mitesh M. Khapra
Published in:
CoRR (2021)
Keyphrases
</>
digital libraries
data sets
neural network
cross language
language independent
social networks
image processing
decision trees
bayesian networks
probabilistic model