The WAW Corpus: The First Corpus of Interpreted Speeches and their Translations for English and Arabic.
Ahmed AbdelaliIrina P. TemnikovaSamy HedayaStephan VogelPublished in: LREC (2018)
Keyphrases
- link grammar
- open domain
- machine translation
- person names
- unknown words
- machine translation system
- statistical machine translation
- training corpus
- semantic roles
- broad coverage
- multiword
- parallel corpus
- language independent
- english words
- wide coverage
- natural language
- morphological analysis
- english language
- language identification
- pos tagging
- language learning