Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic.
Soumajyoti SarkarKaixiang LinSailik SenguptaLeonard LausenSheng ZhaSaab MansourPublished in: CoRR (2022)
Keyphrases
- raw data
- training data
- data analysis
- complex data
- data sets
- statistical analysis
- database
- data processing
- synthetic data
- data sources
- data points
- computer systems
- data retrieval
- experimental data
- missing data
- test set
- training samples
- supervised learning
- image data
- knowledge discovery
- prior knowledge
- relational databases
- data structure