Unsupervised Structured Data Extraction from Template-generated Web Pages.
Tomas GrigalisAntanas CenysPublished in: J. Univers. Comput. Sci. (2014)
Keyphrases
- structured data
- information extraction
- web pages
- structured information
- semi structured
- data extraction
- unstructured data
- unstructured information
- semi structured data
- free text
- website
- search engine
- web documents
- textual data
- data sources
- keyword search
- data records
- web search engines
- web databases
- xml documents
- linked data
- metadata
- keywords
- unstructured text
- link analysis
- deep web
- structured databases
- web data
- graph kernels
- page contents
- semistructured data
- natural language processing
- machine learning
- web search
- semi supervised
- information retrieval
- databases
- structured and unstructured data
- data sets