Better Quality Pre-training Data and T5 Models for African Languages.
Akintunde OladipoMofetoluwa AdeyemiOrevaoghene AhiaAbraham Toluwase OwodunniOdunayo OgundepoDavid Ifeoluwa AdelaniJimmy LinPublished in: EMNLP (2023)
Keyphrases
- training data
- prior knowledge
- databases
- high quality
- classification accuracy
- generalization error
- domain knowledge
- classification models
- computational models
- model selection
- complex systems
- experimental data
- expressive power
- statistical models
- quality prediction
- data sets
- supervised learning
- probabilistic model
- learning algorithm
- neural network