Corpus Based Unsupervised Labeling of Documents.
Delip RaoDeepak PDeepak KhemaniPublished in: FLAIRS Conference (2006)
Keyphrases
- unsupervised learning
- document collections
- information retrieval systems
- topic modeling
- document clustering
- web documents
- information retrieval
- active learning
- text documents
- metadata
- image segmentation
- supervised learning
- semi supervised
- relevant documents
- unsupervised manner
- database
- structured documents
- document retrieval
- document classification
- digital documents
- labeling scheme
- document representation
- legal documents
- query terms
- vector space
- user queries
- test collection
- semantic information
- semi supervised learning
- query processing
- xml documents
- training data