A platform for storing, visualizing, and interpreting collections of noisy documents.
Bart LamiroyDaniel P. LoprestiPublished in: AND (2010)
Keyphrases
- document collections
- information retrieval
- heterogeneous collections
- data collections
- metadata
- relevant documents
- text collections
- document retrieval
- similar documents
- digital libraries
- information retrieval systems
- document archives
- distributed information retrieval
- document classification
- text retrieval
- document clustering
- effective retrieval
- vector space model
- noisy data
- test collection
- data structure
- vector space
- text documents
- document representation
- digital objects
- xml documents
- real time
- retrieval model
- keywords
- legal documents
- electronic documents
- term weighting schemes
- trec collections
- web documents
- storage and retrieval
- database
- semantic information
- multi document summarization
- retrieved documents
- bibliographic databases
- document set
- retrieval systems
- document repositories