Sélection par entropie de descripteurs textuels pour la catégorisation de documents XML.
Christine LargeronChristophe MoulinPublished in: EGC (2010)
Keyphrases
- xml documents
- xml format
- semi structured documents
- document centric
- metadata
- xml data
- document structure
- extensible markup language
- standard for data exchange
- document repository
- xml schema
- information retrieval systems
- electronic documents
- information retrieval
- data model
- structured documents
- xml queries
- xml databases
- semi structured data
- document clustering
- document management
- relational databases
- database
- document retrieval
- keyword search
- xml fragments
- structured data
- document collections
- markup language
- keywords
- document type
- data interchange
- databases
- document analysis
- xml retrieval
- vector space model
- semi structured
- data exchange
- text documents
- retrieval systems
- semantic information
- xml elements
- textual documents
- document representation
- xpath expressions
- document classification
- user queries
- web documents
- digital libraries