A word embedding-based approach to cross-lingual topic modeling.
Chia-Hsuan ChangSan-Yih HwangPublished in: Knowl. Inf. Syst. (2021)
Keyphrases
- cross lingual
- monolingual and cross lingual
- topic modeling
- parallel corpus
- text classification
- word sense
- latent topics
- topic models
- probabilistic topic models
- machine translation
- n gram
- language modeling
- latent dirichlet allocation
- language independent
- co occurrence
- cross language
- text mining
- bag of words
- word sense disambiguation
- vector space
- transfer learning
- text categorization
- artificial intelligence
- news articles
- document clustering
- machine learning
- generative model
- knowledge discovery
- clustering algorithm