Finding Frequent Structural Features among Words in Tree-Structured Documents.
Tomoyuki UchidaTomonori MogawaYasuaki NakamuraPublished in: PAKDD (2004)
Keyphrases
- structural features
- structured documents
- document representation
- information retrieval systems
- structural information
- information retrieval
- web documents
- xml documents
- query language
- text documents
- vector space model
- secondary structure
- semantic features
- vector space
- bag of words
- n gram
- feature set
- keywords
- document collections
- database
- image classification
- language model
- knowledge discovery
- multiscale
- knowledge base
- data mining