scb-mt-en-th-2020: A Large English-Thai Parallel Corpus.
Lalita LowphansirikulCharin PolpanumasAttapol T. RutherfordSarana NutanongPublished in: CoRR (2020)
Keyphrases
- parallel corpus
- machine translation
- cross lingual
- query translation
- word alignment
- cross language information retrieval
- machine translation system
- target language
- language independent
- statistical machine translation
- word segmentation
- source language
- sentence pairs
- information extraction
- parallel corpora
- cross language
- word sense disambiguation
- natural language processing
- natural language
- word level
- language model
- knowledge representation
- active learning
- machine learning