Keyphrases
- document collections
- information retrieval
- information retrieval systems
- metadata extraction
- text documents
- xml documents
- semi automated
- keywords
- web documents
- document clustering
- fully automated
- text analysis
- semantic information
- legal documents
- document classification
- vector space model
- digital documents
- electronic documents
- plagiarism detection
- time stamped
- retrieved documents
- data sets
- structured documents
- latent semantic indexing
- vector space
- document set
- latent semantic analysis
- automated analysis
- semantic relationships
- free text
- retrieval effectiveness
- query expansion