Unsupervised discovery and extraction of semi-structured regions in text via self-information.
Eric YehJohn NiekraszDayne FreitagPublished in: AKBC@CIKM (2013)
Keyphrases
- semi structured
- information extraction
- free text
- web documents
- semi structured data
- content and structure
- data collections
- structured data
- unstructured text
- information retrieval
- database
- semantic information
- text information
- information sources
- database systems
- text mining
- knowledge representation
- wrapper generation