Incorporating Word Embedding into Cross-Lingual Topic Modeling.
Chia-Hsuan ChangSan-Yih HwangTou-Hsiang XuiPublished in: BigData Congress (2018)
Keyphrases
- cross lingual
- monolingual and cross lingual
- topic modeling
- parallel corpus
- text classification
- word sense
- latent topics
- n gram
- language modeling
- topic models
- machine translation
- language independent
- cross language
- co occurrence
- text mining
- probabilistic topic models
- vector space
- word sense disambiguation
- latent dirichlet allocation
- language model
- text categorization
- text corpora
- transfer learning
- target language
- relevance model
- collaborative filtering
- high dimensional
- machine learning
- document clustering
- text documents
- probabilistic latent semantic analysis
- information retrieval