Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds.
Yu ZhangYu MengXuan WangSheng WangJiawei HanPublished in: CoRR (2022)
Keyphrases
- topic discovery
- out of vocabulary
- text analysis
- language model
- text classification
- n gram
- word segmentation
- named entity recognition
- topic models
- latent dirichlet allocation
- cross language information retrieval
- broadcast news
- parallel corpora
- cross lingual
- hand crafted
- natural language processing
- query terms
- information extraction
- named entities
- language modeling
- text mining
- query translation
- document retrieval
- expectation maximization
- retrieval model
- data mining
- text categorization
- data analysis
- feature selection