Novelty-based Incremental Document Clustering for On-line Documents.
Sophoin KhyYoshiharu IshikawaHiroyuki KitagawaPublished in: ICDE Workshops (2006)
Keyphrases
- document clustering
- document collections
- text documents
- document representation
- document clusters
- clustering method
- vector space model
- document similarity
- clustering algorithm
- text mining
- text clustering
- topic extraction
- tf idf
- cosine similarity
- similar documents
- automatic categorization
- document set
- k means
- topic detection
- document categorization
- tolerance rough set
- cluster analysis
- document retrieval
- data analysis
- information retrieval
- text analysis
- relevant documents
- knowledge representation