Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues.
Svetlana StoyanchevPaul PiwekPublished in: LREC (2010)
Keyphrases
- parallel corpus
- cross lingual
- cross language information retrieval
- query translation
- sentence pairs
- machine translation
- language independent
- statistical machine translation
- machine translation system
- word alignment
- parallel texts
- language modeling
- feature selection
- target language
- dialogue system
- natural language processing
- digital libraries