A Scalable System for Identifying Co-derivative Documents.
Yaniv BernsteinJustin ZobelPublished in: SPIRE (2004)
Keyphrases
- information retrieval
- information retrieval systems
- document retrieval
- relevant documents
- document collections
- highly scalable
- text documents
- web documents
- legal documents
- xml documents
- document classification
- keywords
- multiscale
- document analysis
- neural network
- lightweight
- digital documents
- document content
- document structure
- time stamped
- expert finding
- metadata
- text analysis
- free text
- web data
- vector space