STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents.
Hanan SametMichael D. LiebermanJagan SankaranarayananJon SperlingPublished in: DG.O (2007)
Keyphrases
- web documents
- current web search engines
- content similarity
- information retrieval
- textual data
- information retrieval systems
- document retrieval
- web data
- document structure
- multimedia documents
- textual features
- information extraction
- text content
- retrieval systems
- web retrieval
- content extraction
- multilingual documents
- html pages
- search interface
- website
- data extraction
- multimedia information
- search engine
- keywords
- web information
- structured documents
- document collections
- web search
- web pages
- retrieval process
- content and structure
- textual case based reasoning
- textual information
- retrieval model
- current search engines
- expert finding
- query terms
- free text
- metadata
- document analysis
- test collection
- xml documents
- query expansion
- relevant documents
- handwritten documents
- retrieval engine
- web mining
- open directory project
- text retrieval
- document clustering
- multimedia
- term frequency
- document repositories
- keyword queries
- document content
- semantic web
- web content
- natural language
- image retrieval
- topic distillation
- structured data
- textual contents
- retrieval strategies
- retrieval effectiveness
- plain text
- related documents
- user queries
- xml retrieval
- textual descriptions
- text documents