Topical Crawler based on multi-level vector space model and optimized hyperlink chosen strategy.
Yang XuSui Ai-naTang Zhan-kunPublished in: IEEE ICCI (2010)
Keyphrases
- vector space model
- web documents
- topic specific
- web pages
- information retrieval
- semantic similarity
- search engine
- index terms
- vector space
- language model
- retrieval model
- keywords
- semantic information
- tf idf
- document clustering
- document representation
- website
- latent semantic indexing
- web users
- feature selection
- text classification
- topic modeling
- link analysis
- principal component analysis
- text documents
- similarity search