A Clustering Approach for XML Linked Documents.
Barbara CataniaAnna MaddalenaPublished in: DEXA Workshops (2002)
Keyphrases
- xml documents
- document clustering
- xml format
- semi structured documents
- metadata
- document structure
- text clustering
- information retrieval systems
- document centric
- clustering algorithm
- clustering method
- k means
- relational databases
- xml data
- structured documents
- xml schema
- document collections
- document representation
- extensible markup language
- document repository
- web documents
- free text
- database
- xml queries
- semi structured data
- hierarchical clustering
- spectral clustering
- content and structure
- information retrieval
- cosine similarity
- document analysis
- content similarity
- standard for data exchange
- document retrieval
- semantic information
- structured data
- object oriented
- databases
- xml retrieval
- markup language
- ranked list
- semi structured
- user queries
- data integration
- document type
- data model
- xml files