Endless and Scalable Knowledge Table Extraction from Semi-structured Websites.
Yingqin GuLei JiZiheng JiangJun HePublished in: ICDM Workshops (2012)
Keyphrases
- semi structured
- structured knowledge
- knowledge rich
- structured data
- web scale
- wrapper induction
- data model
- information extraction
- domain knowledge
- web sources
- semi structured data
- information integration
- free text
- web documents
- web data extraction
- web data
- text mining
- data extraction
- knowledge representation
- data collections
- unstructured text
- knowledge acquisition
- knowledge discovery
- wrapper generation
- relational databases
- content and structure
- knowledge representation and reasoning
- xml databases
- html pages
- multi view
- data mining techniques
- active learning
- xml documents
- database
- web data sources