Unsupervised extraction of text segments from heterogeneous document collections.
Hong CuiPublished in: ASIST (2010)
Keyphrases
- document collections
- text segments
- information retrieval systems
- document retrieval
- information retrieval
- test collection
- text retrieval
- text summarization
- relevant documents
- information extraction
- unsupervised learning
- digital libraries
- multiword
- scatter gather
- document representation
- document clustering
- text collections
- semi supervised
- topic detection
- text data
- context sensitive
- feature selection
- language model
- data mining
- learning process
- document archives