Keyphrases
- document type
- information retrieval
- document collections
- document structure
- web documents
- document classification
- free text
- semi structured
- text documents
- electronic documents
- relevant documents
- information retrieval systems
- information extraction
- xml documents
- metadata
- plain text
- document clustering
- web pages
- ranked list
- structured documents
- html documents
- retrieval systems
- user queries
- html pages
- legal documents
- web data
- text retrieval
- document set
- multi document summarization
- multimedia documents
- vector space
- xml format
- keywords