Norm-based Noisy Corpora Filtering and Refurbishing in Neural Machine Translation.
Yu LuJiajun ZhangPublished in: EMNLP (2022)
Keyphrases
- machine translation
- natural language processing
- chinese english
- statistical machine translation
- parallel corpus
- parallel corpora
- information extraction
- cross language information retrieval
- machine readable dictionaries
- cross lingual
- comparable corpora
- language independent
- word sense disambiguation
- language processing
- word alignment
- target language
- source language
- brazilian portuguese
- linguistic resources
- finite state transducers
- machine translation system
- natural language
- natural language generation
- language resources
- question answering
- word level
- training corpus
- machine learning
- bilingual lexicon
- wordnet