Costco: Robust Content and Structure Constrained Clustering of Networked Documents.
Su YanDongwon LeeAlex Hai WangPublished in: CICLing (2) (2011)
Keyphrases
- content and structure
- constrained clustering
- xml documents
- xml retrieval
- semi structured
- information retrieval
- xml queries
- document collections
- duplicate detection
- clustering method
- instance level constraints
- retrieval systems
- database
- information retrieval systems
- clustering algorithm
- databases
- hierarchical clustering
- vector space model
- multimedia
- metadata
- data mining