AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing.
Asaad AlghamdiXinyu DuanWei JiangZhenhai WangYimeng WuQingrong XiaZhefeng WangYi ZhengMehdi RezagholizadehBaoxing HuaiPeilun ChengAbbas GhaddarPublished in: CoRR (2023)
Keyphrases
- experimental data
- data sets
- high level
- empirical data
- data analysis
- computational model
- input data
- simulation data
- test data
- training data
- data structure
- mathematical model
- data collection
- data sources
- global scale
- end users
- machine learning
- data processing
- natural language processing
- synthetic data
- statistical analysis
- sensor data
- database
- high quality
- data quality
- knowledge discovery
- probability distribution