Bayesian multilingual topic model for zero-shot cross-lingual topic identification.
Santosh KesirajuSangeet SagarOndrej GlembekLukás BurgetSuryakanth V. GangashettyPublished in: CoRR (2020)
Keyphrases
- cross lingual
- topic models
- latent dirichlet allocation
- topic modeling
- language modeling
- cross lingual information retrieval
- machine translation
- cross language
- probabilistic model
- text classification
- language independent
- latent topics
- topic discovery
- co occurrence
- text documents
- text mining
- generative model
- latent variables
- parallel corpus
- language specific
- news articles
- bayesian networks
- bayesian inference
- probabilistic topic models
- document clustering
- language model
- transfer learning
- word sense
- machine learning
- bag of words
- word alignment
- text streams