Ré-ordonnancement pour l'apprentissage de transformations de documents HTML.
Guillaume WisniewskiPatrick GallinariPublished in: EGC (2007)
Keyphrases
- document type
- document structure
- information retrieval
- xml documents
- document collections
- information extraction
- information retrieval systems
- document classification
- web documents
- relevant documents
- text documents
- structured documents
- document retrieval
- semi structured
- extensible markup language
- metadata
- free text
- web pages
- document analysis
- document clustering
- electronic documents
- database
- user interface
- keywords
- web browser
- ranked list
- vector space model
- latent semantic analysis
- retrieval systems
- multimedia documents
- semantic information
- search engine
- text categorization
- html pages
- legal documents