On the Automatic Extraction of Data from the Hidden Web.
Stephen W. LiddleSai Ho YauDavid W. EmbleyPublished in: ER (Workshops) (2001)
Keyphrases
- automatic extraction
- data sets
- data analysis
- database
- synthetic data
- statistical analysis
- website
- essential information
- web sources
- raw data
- information sources
- data points
- data collection
- web content
- web data
- image data
- data extraction
- data sources
- digital libraries
- log data
- web logs
- increasing rapidly
- data quality
- data processing
- input data
- data mining techniques
- knowledge discovery
- end users
- natural language
- high quality
- web pages