Interprétation vague des contraintes structurelles pour la RI dans des corpus de documents XML. Évaluation d'une méthode approchée de RI structurée.
Eugen PopoviciGildas MénierPierre-François MarteauPublished in: Document Numérique (2007)
Keyphrases
- xml documents
- xml format
- semi structured documents
- metadata
- document centric
- document structure
- information retrieval
- databases
- word frequencies
- xml data
- newspaper articles
- free text
- xml schema
- semi structured
- document retrieval
- person names
- document collections
- xml queries
- information retrieval systems
- text data
- document representation
- structured documents
- document corpus
- data exchange
- training corpus
- content and structure
- text corpus
- relational databases
- document repository
- web documents
- relevant documents
- extensible markup language
- keyword search
- document level
- xpath queries
- database
- document clustering
- machine translation
- word frequency
- sentence level
- text corpora
- xml retrieval
- markup language
- vector space model
- standard for data exchange