Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19.
Muhammad Abdul-MageedAbdelRahim A. ElmadanyEl Moatez Billah NagoudiDinesh PabbiKunal VermaRannie LinPublished in: EACL (2021)
Keyphrases
- benchmark datasets
- scale space
- database
- language identification
- neural network
- expressive power
- language independent
- databases
- human actions
- small scale
- text summarization
- description languages
- grammatical inference
- target language
- training dataset
- synthetic datasets
- feature space
- information systems
- information retrieval
- data sets