Improving Web Page Classification by Integrating Neighboring Pages via a Topic Model.
Wongkot SriuraiPhayung MeesadChoochart HaruechaiyasakPublished in: IICS (2010)
Keyphrases
- topic models
- web page classification
- web pages
- topic modeling
- anchor text
- latent dirichlet allocation
- text classification
- web mining
- text mining
- automatic classification
- latent topics
- search engine
- text documents
- probabilistic model
- generative model
- feature selection
- co occurrence
- web users
- web data
- web documents
- keywords
- link analysis
- databases
- markov chain
- bayesian networks
- real world