THESUS: Organizing Web document collections based on link semantics.
Maria HalkidiBenjamin NguyenIraklis VarlamisMichalis VazirgiannisPublished in: VLDB J. (2003)
Keyphrases
- document collections
- document retrieval
- information retrieval
- information retrieval systems
- text retrieval
- ad hoc retrieval
- web documents
- test collection
- automatic document classification
- document representation
- scatter gather
- document clustering
- data collections
- web users
- relevant documents
- digital libraries
- web mining
- semantic information
- text data
- text collections
- geographic information retrieval
- web pages
- document archives
- cross language
- link analysis
- document clusters
- information access
- topic detection
- anchor text
- xml retrieval
- web graph
- database
- machine learning
- databases