Ensemble Machine Translation to Filter Low Quality Corpus.
Wuying LiuLin WangPublished in: IALP (2022)
Keyphrases
- low quality
- machine translation
- statistical machine translation
- high quality
- parallel corpora
- machine translation system
- chinese english
- parallel corpus
- pos tagging
- natural language processing
- cross lingual
- information extraction
- language independent
- cross language information retrieval
- target language
- word alignment
- comparable corpora
- brazilian portuguese
- word sense disambiguation
- query translation
- natural language
- artificial intelligence
- language resources
- training set
- noun phrases
- distance measure
- language model
- feature vectors
- machine learning