Interprétation vague des contraintes structurelles pour la RI dans des corpus de documents XML - Évaluation d'une méthode approchée de RI structurée
Eugen PopoviciGildas MénierPierre-Francois MarteauPublished in: CoRR (2008)
Keyphrases
- xml documents
- semi structured documents
- xml format
- metadata
- information retrieval
- document structure
- document collections
- xml schema
- newspaper articles
- person names
- databases
- training corpus
- markup language
- document clustering
- document centric
- xml data
- information retrieval systems
- word frequencies
- document repository
- electronic documents
- free text
- content and structure
- relational databases
- natural language text
- structured documents
- word pairs
- text corpora
- parallel corpora
- xpath expressions
- semi structured data
- xml retrieval
- database
- information extraction
- web documents
- document retrieval
- standard for data exchange
- data interchange
- topic models
- semantic information
- user queries
- word frequency
- text documents
- wikipedia articles
- data exchange
- keyword search
- text collections
- xml queries
- multiword