Extraction de relations dans les documents Web.
Rémi GilleronPatrick MartyMarc TommasiFabien TorrePublished in: EGC (2006)
Keyphrases
- web documents
- web data
- information extraction
- multilingual documents
- web information extraction
- web information
- data extraction
- website
- textual data
- web pages
- structured information
- document classification
- link analysis
- document collections
- text information
- content similarity
- text documents
- open directory project
- extraction rules
- database
- information retrieval systems
- information retrieval
- document repositories
- digital documents
- newspaper articles
- logical structure
- metadata
- web content
- web applications
- keywords
- xml documents
- web queries
- semantic web
- topic specific
- linguistic analysis
- document structure
- web crawler
- google scholar
- current web search engines
- multimedia documents
- structured data
- vector space model
- user interests
- document retrieval
- text content
- web environment
- natural language processing
- relevant content
- relational databases
- user queries
- relevant documents
- web users