Real-Time Full-Text Clustering of Networked Documents.
Mehran SahamiSalim YusufaliMichelle Q. Wang BaldonadoPublished in: AAAI/IAAI (1997)
Keyphrases
- real time
- document clustering
- information retrieval systems
- journal articles
- retrieval systems
- clustering algorithm
- clustering method
- document collections
- information retrieval
- k means
- text clustering
- document classification
- self organizing maps
- low cost
- metadata
- digital libraries
- latent semantic analysis
- vision system
- keywords
- hierarchical clustering
- data objects
- unsupervised learning
- multimedia
- fuzzy clustering
- spectral clustering
- cosine similarity
- xml documents
- database
- web documents
- plain text
- document clusters
- mutual reinforcement
- document representation
- vector space model
- data clustering
- text documents
- relevant documents
- high dimensional data
- text categorization