Vers une détection en temps réel de documents Web centrés sur une entité donnée.
Ludovic BonnefoyVincent BouvierRomain DeveaudPatrice BellotPublished in: CORIA (2013)
Keyphrases
- web documents
- web data
- web information
- multilingual documents
- website
- web pages
- text information
- digital documents
- open directory project
- focused crawling
- document collections
- electronic documents
- newspaper articles
- keywords
- information retrieval
- web applications
- information retrieval systems
- document classification
- textual data
- topic specific
- document retrieval
- web mining
- semi structured
- meta information
- web crawler
- structured information
- search interface
- web users
- text documents
- digital libraries
- content similarity
- web content
- information sources
- desired information
- document repositories
- answering questions
- current web search engines
- database
- search click data
- textual features
- web environment
- web queries
- document representation
- web search
- information extraction
- end users
- search engine