Cross-Lingual Text Classification with Multilingual Distillation and Zero-Shot-Aware Training.
Ziqing YangYiming CuiZhigang ChenShijin WangPublished in: CoRR (2022)
Keyphrases
- cross lingual
- text classification
- text classifiers
- language independent
- cross lingual information retrieval
- cross language
- language modeling
- multi lingual
- text categorization
- bag of words
- feature selection
- text mining
- text documents
- machine learning
- n gram
- labeled data
- knn
- parallel corpora
- training set
- translation model
- monolingual and cross lingual
- supervised learning
- word alignment
- unlabeled data
- parallel corpus
- web news
- language specific
- query translation
- statistical machine translation
- source language
- neural network
- machine translation