Overview of the INEX 2009 XML Mining Track: Clustering and Classification of XML Documents.
Richi NayakChristopher M. De VriesSangeetha KuttyShlomo GevaLudovic DenoyerPatrick GallinariPublished in: INEX (2009)
Keyphrases
- xml documents
- tensor space model
- xml data
- xml databases
- xml queries
- unsupervised learning
- relational databases
- data model
- querying xml documents
- xml schema
- clustering analysis
- clustering algorithm
- relational data
- clustering method
- keyword search
- xml trees
- xml information retrieval
- text mining
- structured data
- decision trees
- xpath queries
- semi structured data
- labeling scheme
- content and structure
- document structure
- feature selection
- metadata
- xml data sources
- document centric
- native xml
- xml format
- feature space
- query language
- xml fragments
- support vector machine
- sql queries
- regular expressions
- data mining
- data exchange
- document clustering