TANDO: A Corpus for Document-level Machine Translation.
Harritxu GeteThierry EtchegoyhenDavid PonceGorka LabakaNora AranberriAnder CorralXabier SaralegiIgor EllakuriaMaite MartínPublished in: LREC (2022)
Keyphrases
- document level
- lexical cohesion
- machine translation
- language model
- sentiment classification
- sentence level
- cross lingual
- query expansion
- word level
- statistical machine translation
- language independent
- coreference resolution
- document retrieval
- chinese english
- natural language processing
- sentiment analysis
- target language
- pseudo relevance feedback
- information retrieval
- multiword
- retrieval strategies
- information extraction