Extended VSM for XML Document Classification Using Frequent Subtrees.
Jianwu YangSonglin WangPublished in: INEX (2009)
Keyphrases
- document classification
- frequent subtrees
- xml documents
- text categorization
- web documents
- text classification
- classification algorithm
- text mining
- xml data
- vector space model
- tree mining
- xml queries
- metadata
- rooted trees
- unordered trees
- text documents
- database
- xml schema
- keyword search
- semi structured data
- mining frequent
- xml trees
- data model
- relational databases
- structured data
- tree structured data
- xml databases
- nearest neighbor
- xml retrieval
- training set