Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification.
Mozhi ZhangYoshinari FujinumaJordan L. Boyd-GraberPublished in: AAAI (2020)
Keyphrases
- document classification
- cross lingual
- text classification
- n gram
- machine translation
- language modeling
- word alignment
- text mining
- text categorization
- cross language
- web documents
- text documents
- classification algorithm
- similarity measure
- bag of words
- machine learning
- transfer learning
- language model
- knn
- feature selection
- probabilistic model
- statistical machine translation