Keyphrases
- structured documents
- electronic documents
- information retrieval
- xml documents
- document collections
- metadata
- document classification
- information retrieval systems
- web documents
- relevant documents
- document representation
- logical structure
- extensible markup language
- neural network
- digital documents
- textual content
- document analysis
- markup language
- web data
- document clustering
- clustering algorithm