Annotation-aware web clustering based on topic model and random walks.
Jiashen SunXiaojie WangCaixia YuanGuannan FangPublished in: CCIS (2011)
Keyphrases
- random walk
- topic models
- latent dirichlet allocation
- topic modeling
- web graph
- text documents
- probabilistic model
- web mining
- latent variables
- text mining
- co occurrence
- transition probabilities
- web data
- markov chain
- active learning
- latent topics
- link analysis
- link spam
- microblog posts
- link prediction
- web documents
- web pages
- generative model
- probabilistic topic models
- information extraction
- pairwise
- machine learning