A Benchmark Dataset for Multi-Level Complexity-Controllable Machine Translation.
Kazuki TaniRyoya YuasaKazuki TakikawaAkihiro TamuraTomoyuki KajiwaraTakashi NinomiyaTsuneo KatoPublished in: LREC (2022)
Keyphrases
- machine translation
- benchmark datasets
- language independent
- cross lingual
- natural language processing
- language processing
- information extraction
- natural language generation
- natural language
- cross language information retrieval
- word sense disambiguation
- machine translation system
- word alignment
- target language
- chinese english
- multilingual documents
- language resources
- word level
- statistical machine translation
- machine transliteration
- brazilian portuguese
- data mining
- tasks in natural language processing
- parallel corpora
- information retrieval systems
- search engine