A High-Quality Multilingual Dataset for Structured Documentation Translation.
Kazuma HashimotoRaffaella BuschiazzoJames BradburyTeresa MarshallRichard SocherCaiming XiongPublished in: WMT (1) (2019)
Keyphrases
- high quality
- cross language information retrieval
- language resources
- machine translation
- cross language
- parallel corpus
- cross lingual information retrieval
- machine translation system
- low quality
- digital libraries
- cross language ir
- cross lingual
- benchmark datasets
- query translation
- high resolution
- language independent
- structured queries
- comparable corpora
- chinese english
- ground truth
- statistical machine translation
- image quality
- feature set
- synthetic datasets
- super resolution
- bilingual dictionaries