Keyphrases
- information retrieval
- document collections
- text analysis
- metadata
- xml documents
- information retrieval systems
- web documents
- relevant documents
- document classification
- legal documents
- plagiarism detection
- document analysis
- document retrieval
- latent semantic analysis
- document clustering
- structured documents
- free text
- web data
- semantic information
- vector space model
- document representation
- neural network
- query terms
- document set
- web search engines
- document structure
- search engine
- machine learning