TaTa: A Multilingual Table-to-Text Dataset for African Languages.
Sebastian GehrmannSebastian RuderVitaly NikolaevJan A. BothaMichael ChavindaAnkur P. ParikhClara RiveraPublished in: CoRR (2022)
Keyphrases
- multi lingual
- language specific
- language independent
- multilingual documents
- cross lingual
- database
- indian languages
- text generation
- language identification
- multilingual information retrieval
- machine translation system
- text retrieval
- english text
- machine translation
- expressive power
- information access
- text summarization
- information retrieval
- natural language
- databases
- native language
- manually constructed
- comparable corpora
- cross lingual information retrieval
- text mining
- plain text
- benchmark datasets
- language resources
- cross language
- arabic language
- natural language generation
- out of vocabulary
- query translation
- text documents
- free text
- language modeling
- cross language information retrieval