Toward a Retrieval of HTML Documents using a Semantic Approach.
Fernando FerriCristina GhiselliPatrizia GrifoniMarco PadulaPublished in: IEEE International Conference on Multimedia and Expo (III) (2000)
Keyphrases
- html documents
- web page retrieval
- semantic information
- content extraction
- structured documents
- web documents
- information retrieval
- automatic extraction
- topic maps
- language model
- web pages
- query expansion
- high level
- semi structured
- semistructured data
- retrieval model
- document retrieval
- relevance feedback
- image retrieval
- semantic features
- repeated patterns
- semantic similarity
- test collection
- domain ontology
- information retrieval systems
- natural language processing
- domain knowledge
- xml documents
- keywords
- web content
- low level features
- database
- wordnet
- web search
- information extraction
- digital libraries