ParaZh-22M: A Large-Scale Chinese Parabank via Machine Translation.
Wenjie HaoHongfei XuDeyi XiongHongying ZanLingling MuPublished in: COLING (2022)
Keyphrases
- machine translation
- chinese english
- english chinese
- language independent
- cross lingual
- language processing
- natural language processing
- phrase based smt
- cross language information retrieval
- target language
- information extraction
- statistical machine translation
- natural language
- machine translation system
- natural language generation
- word segmentation
- language resources
- word sense disambiguation
- query translation
- parallel corpora
- word alignment
- bi directional
- foreign language
- text summarization
- word level
- parallel corpus
- comparable corpora
- wordnet
- text mining
- multilingual documents
- information retrieval