Classifying Documents by Distributed P2P Clustering.
Martin EisenhardtWolfgang MüllerAndreas HenrichPublished in: GI Jahrestagung (2) (2003)
Keyphrases
- peer to peer
- document clustering
- scalable distributed
- distributed environment
- document collections
- distributed network
- text clustering
- clustering method
- clustering algorithm
- distributed systems
- information retrieval
- fully distributed
- k means
- information retrieval systems
- document classification
- cooperative
- xml documents
- document retrieval
- multi agent
- web documents
- cosine similarity
- peer to peer networks
- resource discovery
- data points
- data sharing
- query routing
- relevant documents
- text categorization
- range query processing
- data objects
- hierarchical clustering
- resource sharing
- vector space model
- metadata
- data clustering
- text documents
- mobile agents
- heterogeneous collections
- content similarity
- pre classified
- automatic text classification
- structured peer to peer
- distributed information retrieval
- peer to peer systems
- computing environments
- cluster analysis
- retrieval systems
- load balancing
- user queries
- wordnet
- unsupervised learning
- text mining
- digital libraries
- multimedia