HCX: an efficient hybrid clustering approach for XML documents.
Sangeetha KuttyRichi NayakYuefeng LiPublished in: ACM Symposium on Document Engineering (2009)
Keyphrases
- xml documents
- tensor space model
- clustering method
- clustering algorithm
- k means
- relational databases
- xml data
- categorical data
- graph theoretic
- data clustering
- cluster analysis
- outlier detection
- xml databases
- relational data
- document clustering
- unsupervised learning
- semi structured
- data mining
- keyword search
- self organizing maps
- structured data
- xml information retrieval
- information retrieval