InfoCTM: A Mutual Information Maximization Perspective of Cross-Lingual Topic Modeling.
Xiaobao WuXinshuai DongThong NguyenChaoqun LiuLiangming PanAnh Tuan LuuPublished in: CoRR (2023)
Keyphrases
- cross lingual
- monolingual and cross lingual
- topic modeling
- text classification
- probabilistic topic models
- topic models
- machine translation
- language modeling
- language independent
- latent dirichlet allocation
- text mining
- collaborative filtering
- language model
- relevance model
- document clustering
- transfer learning
- text documents
- text categorization
- information retrieval
- data mining
- news articles
- bayesian networks
- feature extraction
- word sense
- text corpora
- latent topics
- artificial intelligence