Creating Large-scale Training and Test Corpora for Extracting Structured Data from the Web.
Robert MeuselHeiko PaulheimPublished in: LD4IE@ISWC (2015)
Keyphrases
- structured data
- semi structured data
- linked data
- unstructured information
- structured information
- textual data
- semi structured
- unstructured data
- text data
- information extraction
- web data
- unstructured text
- structured and unstructured data
- relational data
- web databases
- xml documents
- website
- web documents
- keyword search
- keyword queries
- structured databases
- data sources
- web mining
- metadata
- web pages
- information management
- natural language processing
- machine translation
- text mining
- decision trees
- graph based induction
- semistructured data
- search engine
- semantic web
- tree kernels
- query processing