JParaCrawl v3.0: A Large-scale English-Japanese Parallel Corpus.
Makoto MorishitaKatsuki ChousaJun SuzukiMasaaki NagataPublished in: LREC (2022)
Keyphrases
- parallel corpus
- cross lingual
- machine translation
- cross language information retrieval
- query translation
- machine translation system
- language independent
- statistical machine translation
- word alignment
- target language
- sentence pairs
- cross language
- source language
- semantic space
- text mining
- semi supervised
- knowledge representation