Lifting the Curse of Multilinguality by Pre-training Modular Transformers.
Jonas PfeifferNaman GoyalXi Victoria LinXian LiJames CrossSebastian RiedelMikel ArtetxePublished in: NAACL-HLT (2022)
Keyphrases
- training process
- training set
- wavelet transform
- real time
- dimension reduction
- test set
- training samples
- dimensionality reduction
- supervised learning
- high dimensional
- expert systems
- data sets
- database
- multiresolution
- feature extraction
- training examples
- real world
- high dimensionality
- training algorithm
- multi layer perceptron