Linking Archives Using Document Enrichment and Term Selection.
Marc BronBouke HuurninkMaarten de RijkePublished in: TPDL (2011)
Keyphrases
- term selection
- text categorization
- ad hoc retrieval
- query expansion
- information retrieval
- relevant documents
- document frequency
- pseudo relevance feedback
- text classification
- information retrieval systems
- document collections
- metadata
- document retrieval
- text documents
- digital libraries
- relevance feedback
- information gain
- expansion terms
- selection mechanism
- document clustering
- retrieval systems
- retrieved documents
- vector space model
- knn
- clustering algorithm
- semi supervised learning
- decision trees
- term frequency
- tf idf
- ranked list
- text mining
- web search
- web documents
- language model
- k nearest neighbor