Combining Structure and Content Similarities for XML Document Clustering.
Tien TranRichi NayakPeter BruzaPublished in: AusDM (2008)
Keyphrases
- document clustering
- content and structure
- metadata
- document representation
- text mining
- document collections
- document structure
- xml documents
- clustering method
- clustering algorithm
- topic extraction
- vector space model
- web documents
- text documents
- latent semantic space
- ant based clustering
- multimedia
- test collection
- language model
- image features
- negative matrix factorization
- image retrieval
- bayesian networks
- real world
- databases