A Multilingual Multiway Evaluation Data Set for Structured Document Translation of Asian Languages.
Bianka BuschbeckRaj DabreMiriam ExelMatthias HuckPatrick HuyRaphael RubinoHideki TanakaPublished in: AACL/IJCNLP (Findings) (2022)
Keyphrases
- structured documents
- data sets
- language resources
- cross lingual information retrieval
- language independent
- cross lingual
- machine translation
- comparable corpora
- cross language
- machine translation system
- cross language information retrieval
- query translation
- bilingual dictionaries
- information retrieval systems
- statistical machine translation
- target language
- databases
- structured document retrieval
- parallel corpora
- parallel corpus
- database
- xml documents
- training set
- multilingual information retrieval
- retrieval systems
- language specific
- metadata