Text-based measures of document diversity.
Kevin BacheDavid NewmanPadhraic SmythPublished in: KDD (2013)
Keyphrases
- semantic information
- textual features
- document classification
- document collections
- document retrieval
- document clustering
- information retrieval systems
- diversity measures
- keywords
- multimedia
- information retrieval
- web documents
- image search
- data sets
- social networks
- retrieval systems
- relevant documents
- machine learning
- data mining
- tf idf
- structured documents
- neural network