Cross-Instance Tuning of Unsupervised Document Clustering Algorithms.
Damianos G. KarakosJason EisnerSanjeev KhudanpurCarey E. PriebePublished in: HLT-NAACL (2007)
Keyphrases
- document clustering
- clustering algorithm
- unsupervised clustering
- tolerance rough set
- document collections
- semi supervised
- information retrieval systems
- text clustering
- k means
- unsupervised learning
- information retrieval
- keywords
- incremental clustering
- unsupervised manner
- document representation
- text mining
- supervised learning
- web documents
- image segmentation
- document analysis
- clustering method
- clustering quality
- retrieval systems
- structured documents
- supervised classification
- data clustering
- document images
- clustering framework
- density based clustering
- pairwise
- density based clustering algorithm
- document set
- clustering analysis
- topic modeling
- text summarization
- cluster analysis
- document retrieval
- relevant documents