TaTA: A Multilingual Table-to-Text Dataset for African Languages.
Sebastian GehrmannSebastian RuderVitaly NikolaevJan A. BothaMichael ChavindaAnkur P. ParikhClara RiveraPublished in: EMNLP (Findings) (2023)
Keyphrases
- multi lingual
- language specific
- language independent
- multilingual documents
- cross lingual
- text generation
- database
- english text
- text summarization
- machine translation system
- information access
- text retrieval
- indian languages
- native language
- multilingual information retrieval
- machine translation
- information retrieval
- language identification
- databases
- manually constructed
- language resources
- free text
- digital libraries
- plain text
- n gram
- text corpora
- expressive power
- arabic language
- cross language
- parallel corpus
- linguistic resources
- keywords
- relational databases
- cross lingual information retrieval