JParaCrawl v3.0: A Large-scale English-Japanese Parallel Corpus.
Makoto MorishitaKatsuki ChousaJun SuzukiMasaaki NagataPublished in: CoRR (2022)
Keyphrases
- parallel corpus
- cross lingual
- machine translation
- cross language information retrieval
- machine translation system
- language independent
- query translation
- word alignment
- statistical machine translation
- target language
- source language
- natural language
- cross language
- sentence pairs
- information retrieval
- bilingual dictionaries
- document clustering
- test collection
- information retrieval systems
- keywords