Measuring the Complexity of a Collection of Documents.
Vishwa VinayIngemar J. CoxNatasa Milic-FraylingKenneth R. WoodPublished in: ECIR (2006)
Keyphrases
- document collections
- automatic categorization
- document retrieval
- information retrieval systems
- distributed information retrieval
- database
- information retrieval
- document repositories
- computational complexity
- text collections
- xml documents
- keywords
- retrieval systems
- text documents
- document clustering
- worst case
- document classification
- document set
- time stamped
- free text
- metadata
- legal documents
- document analysis
- text retrieval
- relevant documents
- user queries
- web documents