Tag-Weighted Topic Model For Large-scale Semi-Structured Documents.
Shuangyin LiJiefei LiGuan HuangRuiyang TanRong PanPublished in: CoRR (2015)
Keyphrases
- topic models
- semi structured documents
- latent dirichlet allocation
- topic modeling
- free text
- semi structured
- probabilistic model
- latent topics
- generative model
- text mining
- xml schema
- text documents
- co occurrence
- xml documents
- latent variables
- real world
- baseline models
- latent topic model
- information extraction
- lda model
- probabilistic topic models
- data integration
- collaborative filtering
- knowledge discovery
- query language
- data model
- query processing
- support vector
- keywords
- search engine
- database