IRLCov19: A Large COVID-19 Multilingual Twitter Dataset of Indian Regional Languages.
Deepak UniyalAmit AgarwalPublished in: PKDD/ECML Workshops (2) (2021)
Keyphrases
- cross lingual
- language independent
- multi lingual
- multilingual information retrieval
- social media
- multilingual documents
- language specific
- social networks
- language resources
- benchmark datasets
- machine translation
- database
- digital libraries
- machine translation system
- comparable corpora
- bilingual dictionaries
- natural language
- social networking
- cross lingual information retrieval
- expressive power
- text summarization
- language identification
- cross language information retrieval
- indian languages
- news articles