A Latent Semantic Approach to XML Clustering by Content and Structure Based on Non-negative Matrix Factorization.
Gianni CostaRiccardo OrtalePublished in: ICMLA (1) (2013)
Keyphrases
- content and structure
- negative matrix factorization
- probabilistic latent semantic analysis
- document clustering
- xml documents
- xml retrieval
- spectral clustering
- semi structured
- xml queries
- clustering method
- sparse representation
- latent semantic analysis
- principal component analysis
- clustering algorithm
- document collections
- data clustering
- matrix factorization
- k means
- xml data
- co occurrence
- cluster analysis
- image classification
- information retrieval systems
- data points
- retrieval systems
- structured data
- document representation
- text mining
- information extraction
- data model