Scalable Attribute-Value Extraction from Semi-structured Text.
Yuk Wah WongDominic WiddowsTom LokovicKamal NigamPublished in: ICDM Workshops (2009)
Keyphrases
- semi structured
- attribute values
- information extraction
- free text
- data extraction
- text mining
- web documents
- structured data
- web scale
- unstructured text
- content and structure
- web data extraction
- information retrieval
- web information extraction
- html pages
- textual data
- semi structured data
- text documents
- information integration
- data model
- web data
- numerical attributes
- structured knowledge
- categorical data
- semi structured documents
- wrapper generation
- wrapper induction
- web data sources
- knowledge rich
- attribute value pairs
- multiple attribute decision making
- web sources
- natural language processing
- unstructured data
- data collections
- automatic extraction
- semantic information
- keywords
- databases
- continuous attributes
- textual information
- extraction patterns
- machine learning