A Probabilistic Approach for Adapting Information Extraction Wrappers and Discovering New Attributes.
Tak-Lam WongWai LamPublished in: ICDM (2004)
Keyphrases
- information extraction
- semi structured
- data extraction
- web information extraction
- precision and recall
- text mining
- free text
- natural language processing
- named entities
- machine learning
- information retrieval
- named entity recognition
- wrapper induction
- question answering
- structured data
- web documents
- relation extraction
- bayesian networks
- attribute values
- open domain
- uncertain data
- conditional random fields
- generative model
- probabilistic model
- text documents
- textual data
- ontology based information extraction
- web mining
- web sources
- text summarization
- word sense disambiguation
- posterior probability
- database
- data mining