Partial data extraction via noisy histogram queries: Information theoretic bounds.
Wei-Ning ChenI-Hsiang WangPublished in: ISIT (2017)
Keyphrases
- information theoretic
- data extraction
- information theory
- mutual information
- query interface
- semi structured
- web data extraction
- query language
- data integration
- theoretic framework
- web databases
- jensen shannon divergence
- query processing
- information bottleneck
- information theoretic measures
- web search engines
- user queries
- data sources
- database
- kullback leibler divergence
- relative entropy
- entropy measure
- web pages
- structured data
- kl divergence
- text mining
- data model
- xml documents
- databases
- data sets