Bridging the domain gap in cross-lingual document classification.
Guokun LaiBarlas OguzYiming YangVeselin StoyanovPublished in: CoRR (2019)
Keyphrases
- document classification
- cross lingual
- text classification
- word alignment
- text categorization
- text mining
- machine translation
- transfer learning
- text documents
- classification algorithm
- language modeling
- web documents
- document clustering
- news articles
- machine learning
- bag of words
- labeled data
- information retrieval
- natural language processing
- probabilistic model
- digital libraries