Document Listing on Repetitive Collections.
Travis GagieKalle KarhuGonzalo NavarroSimon J. PuglisiJouni SirénPublished in: CPM (2013)
Keyphrases
- document collections
- information retrieval
- information retrieval systems
- text collections
- document images
- term weighting schemes
- similar documents
- document retrieval
- document archives
- keywords
- relevant documents
- document classification
- tf idf
- web search
- digital libraries
- data sets
- retrieval systems
- document representation
- database
- information extraction
- data mining
- textual documents
- document analysis
- search engine
- document clustering
- feature selection
- text documents
- multimedia
- test collection