Generating summary documents for a variable-quality PDF document collection.
Jacob HughesDavid F. BrailsfordSteven R. BagleyClive E. AdamsPublished in: ACM Symposium on Document Engineering (2014)
Keyphrases
- document collections
- information retrieval systems
- information retrieval
- document retrieval
- document summaries
- relevant documents
- text retrieval
- test collection
- document representation
- document clustering
- index terms
- document archives
- digital libraries
- text collections
- document set
- cross language
- ad hoc retrieval
- similar documents
- document clusters
- databases
- retrieved documents
- xml retrieval
- documents retrieved