AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing.
Asaad AlghamdiXinyu DuanWei JiangZhenhai WangYimeng WuQingrong XiaZhefeng WangYi ZhengMehdi RezagholizadehBaoxing HuaiPeilun ChengAbbas GhaddarPublished in: ACL (Findings) (2023)
Keyphrases
- prior knowledge
- simulation data
- natural language processing
- data sets
- experimental data
- data processing
- synthetic data
- input data
- probability distribution
- database
- data collection
- data analysis
- high quality
- probabilistic model
- sensor data
- mathematical model
- measured data
- prior information
- test data
- data quality
- computational model
- scale space
- high level
- learning algorithm
- data sources
- artificial intelligence
- training data
- information extraction
- machine learning
- data structure
- multiscale