Keyphrases
- information retrieval
- xml documents
- information retrieval systems
- document collections
- semantic structure
- structural information
- document structure
- relevant documents
- structured information
- document classification
- hierarchically organized
- machine learning
- logical structure
- content and structure
- vector space model
- web data
- document clustering
- vector space
- text documents
- retrieval systems
- digital libraries
- metadata
- feature selection