FooTweets: A Bilingual Parallel Corpus of World Cup Tweets.
Henny Sluyter-GäthjePintu LoharHaithem AfliAndy WayPublished in: LREC (2018)
Keyphrases
- source language
- world cup
- parallel corpus
- target language
- machine translation
- soccer game
- query translation
- cross lingual
- cross language information retrieval
- statistical machine translation
- news articles
- sentence pairs
- cross language
- named entities
- machine translation system
- language independent
- user generated content
- word alignment
- multi modal
- robot soccer