HiLeX: A System for Semantic Information Extraction from Web Documents.
Massimo RuffoloMarco MannaPublished in: ICEIS (Selected Papers) (2006)
Keyphrases
- web documents
- information extraction
- unstructured documents
- natural language
- semantic association
- semi structured
- web search engines
- natural language processing
- linguistic patterns
- text mining
- document classification
- textual information
- vector space model
- named entities
- information retrieval
- structured data
- natural language text
- relation extraction
- question answering
- machine learning
- web content
- semantic similarity
- semantic web
- domain specific
- structured documents
- text documents
- keywords
- document representation
- web mining
- unstructured text
- word sense disambiguation
- query language
- semantic relationships
- wrapper generation
- website
- semantic information