Effective Document Clustering for Large Heterogeneous Law Firm Collections.
Jack G. ConradKhalid Al-KofahiYing ZhaoGeorge KarypisPublished in: ICAIL (2005)
Keyphrases
- document clustering
- document collections
- text mining
- clustering algorithm
- clustering method
- negative matrix factorization
- text documents
- information retrieval
- vector space model
- document representation
- similar documents
- machine learning
- pairwise constraints
- clustering approaches
- tf idf
- real world
- tolerance rough set
- cluster analysis
- unsupervised learning
- information extraction
- k means
- digital libraries
- metadata
- data mining