Transcluded Documents: Advantages of Reusing Document Fragments.
Harald KrottmaierPublished in: ELPUB (2002)
Keyphrases
- document collections
- relevant documents
- document clustering
- document classification
- electronic documents
- information retrieval systems
- web documents
- text documents
- information retrieval
- document content
- digital documents
- document representation
- document retrieval
- document processing
- retrieval systems
- semi structured documents
- document type
- document analysis
- document structure
- document set
- textual content
- structured documents
- document similarity
- unstructured documents
- document repository
- multimedia documents
- vector space model
- document ranking
- user queries
- retrieved documents
- similar documents
- document archives
- keywords
- document summarization
- keyword extraction
- digital libraries
- scientific documents
- index terms
- xml format
- related documents
- document centric
- document level
- textual documents
- query terms
- information extraction
- learning objects
- text mining
- document relevance
- topic hierarchy
- ranked list
- xml documents
- test collection
- scanned documents
- relevance ranking
- text classifiers
- query biased
- text collections
- latent semantic analysis
- maximal marginal relevance
- training documents
- document space
- semantic information
- text content
- printed documents
- logical structure
- term frequency
- retrieval strategies
- pdf documents
- text classification
- query expansion
- tf idf
- pdf files
- automatic text classification
- latent topics