On Precision and Recall of Multi-Attribute Data Extraction from Semistructured Sources.
Guizhen YangSaikat MukherjeeI. V. RamakrishnanPublished in: ICDM (2003)
Keyphrases
- precision and recall
- data extraction
- multi attribute
- semi structured
- information extraction
- web sources
- web data sources
- structured data
- web data extraction
- utility function
- information integration
- data model
- web data
- web documents
- text mining
- data sources
- multi dimensional
- information retrieval
- semistructured data
- information sources
- html pages
- machine learning
- databases
- tree structured patterns
- attribute values
- data integration
- natural language processing
- web mining
- database
- semistructured databases
- natural language
- web pages
- decision making
- semistructured documents