Keyphrases
- document collections
- web documents
- information retrieval
- document type
- document classification
- document structure
- information retrieval systems
- plain text
- database
- document retrieval
- metadata
- relevant documents
- xml documents
- html documents
- information extraction
- html pages
- web pages
- semi structured
- website
- textual content
- free text
- multimedia documents
- document clustering
- document analysis
- vector space model
- ranked list
- text retrieval
- vector space
- text documents
- user interface
- keywords