Improvement in TF-IDF scheme for Web pages based on the contents of their hyperlinked neighboring pages.
Kazunari SugiyamaKenji HatanoMasatoshi YoshikawaShunsuke UemuraPublished in: Systems and Computers in Japan (2005)
Keyphrases
- tf idf
- web pages
- search engine
- website
- weighting scheme
- vector space model
- html pages
- web content
- web documents
- information retrieval
- text documents
- keywords
- link structure
- term frequency
- web search
- ranking algorithm
- document clustering
- retrieval model
- web graph
- text categorization
- term weighting
- page contents
- web search engines
- anchor text
- metadata
- link analysis
- query expansion
- nearest neighbor