A Template-Based Information Extraction from Web Sites with Unstable Markup.
Maxim KolchinFedor KozlovPublished in: SemWebEval@ESWC (2014)
Keyphrases
- information extraction
- website
- wrapper induction
- precision and recall
- natural language processing
- web pages
- semi structured
- text mining
- machine learning
- information retrieval
- named entity recognition
- named entities
- web usage mining
- structured data
- free text
- web mining
- relation extraction
- web usability
- markup language
- semantic tagging
- web users
- ontology based information extraction
- conditional random fields
- machine translation
- web documents
- data extraction
- web usage
- question answering
- co occurrence
- data mining
- web server
- natural language
- association rules
- information extraction systems
- dynamically generated
- web objects
- open domain
- text processing
- textual data
- text documents
- word sense disambiguation