Semantic Document Processing Using Wikipedia as a Knowledge Base.
Ian H. WittenPublished in: INEX (2009)
Keyphrases
- document processing
- knowledge base
- digital libraries
- document images
- information retrieval
- natural language
- information extraction
- knowledge representation
- multimedia documents
- document clustering
- text processing
- high level
- document analysis
- databases
- textual documents
- html documents
- semantic content
- semantic similarity
- domain ontology
- website
- semantic information
- wordnet
- semantic web
- knowledge discovery
- image analysis