Discovering Diverse and Salient Threads in Document Collections.
Jennifer GillenwaterAlex KuleszaBen TaskarPublished in: EMNLP-CoNLL (2012)
Keyphrases
- document collections
- information retrieval systems
- document retrieval
- information retrieval
- text retrieval
- test collection
- document clustering
- document representation
- digital libraries
- cross language
- ad hoc retrieval
- topic detection
- relevant documents
- feature selection
- text data
- text collections
- index terms
- active learning
- databases
- xml retrieval
- data collections