Anti-Serendipity: Finding Useless Documents and Similar Documents.
James W. CooperJohn M. PragerPublished in: HICSS (2000)
Keyphrases
- similar documents
- document collections
- textual documents
- document clustering
- relevant documents
- keywords
- semantic similarity
- document set
- text databases
- information retrieval systems
- inter document similarities
- knowledge extraction
- digital libraries
- test collection
- document representation
- document retrieval
- text retrieval
- machine learning
- clustering method
- text mining
- xml documents
- similarity measure
- information retrieval
- text documents
- tf idf
- user queries
- text summarization
- structured documents
- co occurrence
- semantic content