XML-aided phrase indexing for hypertext documents.
Miro LehtonenAntoine DoucetPublished in: SIGIR (2008)
Keyphrases
- xml documents
- xml format
- semi structured documents
- document structure
- metadata
- document centric
- xml data
- structured documents
- document repository
- extensible markup language
- xml schema
- information retrieval
- html documents
- free text
- standard for data exchange
- xml databases
- semi structured data
- xml queries
- databases
- web documents
- relational databases
- semi structured
- xpath queries
- text documents
- document collections
- information retrieval systems
- content and structure
- electronic documents
- document management
- database
- data model
- relevant documents
- data interchange
- xml trees
- xml files
- document analysis
- data integration
- markup language
- document clustering
- vector space
- path expressions
- textual documents
- labeling scheme
- multi document summarization
- xml retrieval
- text mining
- xml fragments
- object oriented
- structured data
- semantic information
- digital libraries
- document retrieval